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Editor’s Introduction 


I am truly honored to be selected as the eighth editor of Geographical Analysis. At the same 
time, these are very big shoes to fill, as pointed out to me by colleagues. The journal is now be- 
ginning volume 35, and continues to publish 4 issues each year. Geographical Analysis (GA) 
has come to be the premier journal focusing on geographical theory, modeling, and quantita- 
tive methods. This is certainly a noteworthy achievement, and sustaining this stature within the 
discipline will depend on the submission and publication of rigorous and innovative research. 

The journal was started back in 1969 in the Department of Geography at The Ohio State 
University. It has been edited by faculty in this department, most recently, since 1991. Rough 
estimates show that there have been over 520 submissions since 1990, out of which approxi- 
mately 250 have been published. Reviews for these submissions have been provided by some 
400 different scholars. The rate of acceptance over this period is about 50%. As the number of 
publications per year has remained somewhat consistent, it would appear that there have been 
well over 700 published articles in GA requiring the attention of perhaps more than 1000 ex- 
perts spanning a range of specialties and disciplines. Viewed from this perspective, the impact 
of GA has been substantial. This undoubtedly reflects the relatively high impact factor rating of 
the journal (1.024 according to the 2000 rating of the Journal Citation Reports produced by the 
Institute of Scientific Information). 

The journal, and all that is associated with it, is the result of quality submissions that have 
been subject to the highest standards of peer review. In addition, the past editor Morton 
O'Kelly (1996-2002) and those that have served before him (Les King 1969-73, Reg Golledge 
1974-78, Ross MacKinnon 1979-84, Bill Clark 1985-87, Mike Goodchild 1988-90, and 
Emilio Casetti 1991-95) are to be commended for effectively guiding and positioning the jour- 
nal, resulting in the high regard and respect for GA in Geography and associated disciplines. 
On behalf of the previous editors, I would like to also thank past Associate Editors (10 thus far, 
not including the three new additions) and Consulting Editors (94 thus far, not including the 
15 new additions) for their invaluable service contributions to the journal: 


Associate Editors 


R.G. Golledge (1/71-1/74) R.J. Bennett (1/85-10/87 
D.R. Reynolds (1/74-10/78 John P. Jones HI (1/91-10/96) 
J.N. Rayner (7/77-L0/78 Robert Haining (1/96-10/02) 
Emilio Casetti (1/79-10/84 Kavita Pandit (4/97—10/02 
Morton O'Kelly (1/85-10/95 Alan T. Murray (7/01—10/02 


Consulting Editors 


H.J. McCarty (1/69-10/74 Hudson (1/74—10/82) 

J.D. Nystuen (1/69-10/74 L.J. King (1/74—10/90 

G. Torngyist (1/69-10/74 W.R. Tobler (1/74-10/95 

W. Warntz (1/69-10/74 B.J. Garner (7/76-10/84 

A. Wrobel (1/69-10/74 Jettrey P. Osleeb (10/77-10/78) 
B.].L Berry (1/69-10/78 D R. Reynolds (1/79-7/81) 
G.H. Dury (1/69-10/78 S. Gale (1/79-10/82) 

P.R. Gould (1/69-10/78) RS Jarvis 1/79-10/84) 

A.L. Mabogunije (1/69-—10/78) B. Marchand (1/79-10/84 
D.F. Marble (1/69-10/78 N. Alao (1/79-10/87) 

J. Wolpert (1/69-10/78 R.G. Golledge (1/79-10/87) 
R.H.T. Smith (1/69-10/79) Y.Y. Papageorgiou (1/79-10/87) 
P. Haggett ( 1/69-7/S1 P. Korcelli (1/79-10/90 

L.. Curry (1/69—10/87) John Rayner (i/79-10/00) 
D.W. Harvey (7/69-10/78 A.J. Arnfield (1/80—10/84) 

G. Olsson (7/69-10/87 E.S. Sheppard (1/80-10/90) 
Y.V. Medvedkov (1/69-10/78) G. Leonardi (10/81-10/84) 

Z. Chojnicki (1/74-10/78) J. Thornes (10/81-10/84) 

O. Warnervd (1/74-10/78) Robert Haining (10/81—10/95) 
F. Lukermann (1/74-10/79) John Odland (10/81-10/02) 
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D.A. Griffith (1/S3—10/90 Pat Gober (1/91-10/02 

Pipkin (1/83-10/90 Luc Anselin (1/91-present 

].M. Batty (1/S5—10/S7 Michael F. Goodchild (1/91-present 
H. Beguin (1/85-10/S7 Kingsley E. Haynes (1/91-present 
R.D. MacKinnon (7/78—L0/ST Geottrey ].D. Hewings (1/91 present 
D.F. Batten (1/S5—10/90 Arthur Getis (7/91-present 

E.G. Moore ( 1/74—10/90 C. Werner (7/76-10/95 

H. Timmermans ( 1/S5—10/90 Nina S-N. Lam (10/93-present 
Manfred M. Fischer (1/85 present Kelvyn Jones (4/94—-10/02 

R.]. Bennett (1/79—10/90 Avse Can Talen (1/96-10/02 

A. Ghosh (1/SS—10/90 Emilio Casetti (1/96—10/02 

Y. Leung (1/SS-— 10/90 Helen Couclelis (1/96—-10/02 
D.M. Mark (1/SS—10/90 Adrian Esparza (1/96-10/02 

D. Pumain ( 1/SS—10/90 Hee-Yeon Lee (1/96—-10/02 

ReVelle (1/88—10/90 Brigitte Waldorf (1/96—10/02 

LE. Smith ( 1/SS—10/90 Cort Willmott (1/96—10/02 

].-F. Thisse ( 1/SS—10/90 Randall W Jackson 1/96—10/02 
David Plane (1/SS present Harvev Miller (1/96 present 

M.]. Webber ( 1/74-10/90 Atsuvuki Okabe (1/96 present 
William A.V. Clark ( 1/79-10/02 Kavita Pandit (1/96—present 

D.S. Dendrinos ( 1/91—10/95 Sergio Rey (1/96 present 

J.C. Knox (1/91-10/95 Jean-( laude Thill (1/96 present 
N. Wrigley (1/SS—10/95 John P. Jones (10/97—10/02 
Anthol D. Abrahams (1/91—10/02 Deborah Brvan (1/98-—10/02 
Lawrence A. Brown (1/91-—10/02 Mei-Po Kwan ( 10/00—10/02 

A. Stewart Fotheringham (1/91-10/02 James P. Lesage (10/00—present 


Howard L. Gauthier (1/91-10/02 Brian Mikelbank (1/01-10/02 


I should note that some of the above service periods for Consulting Editors have not been 
consecutive 


We have assembled an exceptional board to help maintain the high standards of the journal 
as well as to represent the changing nature of geographical theory. | am grateful to the Associ- 
ate Editors, Consulting Editors and Executive Committee (all acknowledged on the inside 
cover of each journal issue) for their dedication to and support of the journal. I plan to rely ex- 
tensively on the knowledge and expertise of the editorial board in guiding the journal over the 
conung vears 

As editor, a major objective will be to continue to maintain, and hopefully build upon, the 
success of past editors of GA. We are soliciting and encouraging submissions that contribute to 
the continued ce velopment of theoretical geography We hope that published research in the 
coming years continues to build upon, extend and evolve theory and methods applied in a wide 
variety of geographical contexts, both physical and human 

The journal will entertain “Research Notes and Comments” submissions, which appear to 
have subsided in 1993, and “Methodological Discussions” explored in 1983. Further, we hope 
to publish one mnovative Spec ial Issue” each vear as has been done sporadically in the past 

Spatial Temporal Dynamics” in 1981; “Location Analysis and Modeling” in 1990; “Spatial 
Modeling” in 1992: and “The Multinodal Metropolis in 1993). By the way copies of these and 
all other back issues of GA have now been converted to electronic format 

An operational goal is to streamline the reviewing process, to the extent this is possible, by 
encouraging electronic submission of papers currently we are accepting either MS Word or 
PDF formatted submissions). This will facilitate the distribution process, both to the journal 
and to referees. A source of pride is certainly that GA provides timely and thoughtful reviews, 
so we hope to maintain this record 

The journal is only as good as the submissions it receives and the subsequent quality refer- 
eeing that is applied to these submissions. | look forward to facilitating the publication of the 
very best of geographical theory, modeling, and quantitative methods in GA in the coming 
years 


Alan Murray 
Editor, Geographical Analysis 


ga@ geography ohio-state.edu 
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Ron Johnston 
Charles Pattie 


Evaluating an Entropy-Maximizing Solution 
to the Ecological Inference Problem: 
Split-Ticket Voting in New Zealand, 1999 


For geographers, one important aspect of the ecological inference problem relates to 
the analysis of spatial variations in individual behavior. Obtaining estimates of this 
behavior, in the absence of direct data, is often difficult, and the “solution” to the eco- 
logical inference problem propounded by G. King is not relevant in all circumstances. 
An alternative, using a different approach, has been used for some time in electoral 
studies but has lacked “real” data against which to assess the accuracy of its estimates. 
The availability of such data for New Zealand's 1999 general election allows such an 
assessment to be made—wvith very favourable results. The estimates are then used to 
test hypotheses regarding the spatial variation in split-ticket voting, again with con- 
siderable success. 


The problems of ecological inference and fallacies present major constraints to the 
analysis of spatial variations in a wide range of individual behaviors. In many situa- 
tions, no data are available with which to map such (supposed?) variations, whereas in 
others the only available data refer to population aggregates, analysis of which pro- 
vides circumstantial evidence only of the spatial variations of interest. 

Over the decades since the nature of these problems was first recognized, social 
scientists have advanced a variety of methods for tackling them. In 1997, however, 
King (1997) claimed to have solved the problem with a procedure that produces esti- 
mates of the unknown quantities and thereby provides the needed information for 
analyzing spatial variations. That procedure has drawbacks, however, particularly 
with regard to multi-state situations (i.e., where the behavior being considered is 
more than binary in the number of states), hence continued work on alternatives (as 
in King, Rosen, and Tanner 1999: Rosen et al. 2001). Furthermore, although King’s 


The split-ticket voting matrix in Table 1 was produced by the New Zealand Electoral Study 1999 
http:/Avww.nzes.org/1999/splitvote.html). We are grateful to Professor Jack Vowles of the Department of 
Politics, University of Auckland, for this and other data, and the New Zealand Electoral Commission for 
providing the spending data 


Ron Johnston is a professor in the School of Geographical Sciences, University of Bristol. 
Email: r.johnston@bristol.ac.uk. Charles Pattie is a professor in the Department of Geography, 


University of Sheffield. E-Mail: c.pattie@sheffield.ac.uk 


Geographical Analysis, Vol. 35, No. 1 (January 2003) The Ohio State University 
Submitted: March 21, 2002. Revised version accepted: June 27, 2002. 
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method has been employed to produce estimates of spatial variations in patterns of 
behavior—as in Burden and Kimball's (1995) on ticket-splitting at U.S. federal elec- 
tions—the focus has been almost entirely on estimating the internal cells of 2 x 2 ma- 
trices, with little or no reference to those cells in the set of sub-matrices from which 
the overall estimates are derived. (See, for example, Kousser 2001, and other papers 
in his special issue on ecological inference in historical studies. ) 

One of those alternatives is entropy-maximizing (EMax), a procedure first intro- 
duced to geographers more than three decades ago (Wilson 1967, 1970). It has been 
adapted to tackle particular forms of the ecological inference problem—amainly in the 
study of voting behavior (as first set out in Johnston and Hay 1982, 1983; see also 
Johnston, Pattie, and Allsopp 19588). One drawback with its application to date, how- 
ever, has been the lack of data against which to check the accuracy of its estimated 
values. This paper reports on an application—split-ticket voting at New Zealand's 
1999 general election—for which excellent data to prov ide such a check are available. 
It first establishes the goodness-of-fit between the actual and estimated spatial varia- 
tion in the variables of interest, and then uses the estimated values to analyze the na- 
ture and co-variates of that variation 


THE PROBLEM 


The nature of the geographical-ecological inference problem is illustrated by Fig- 
ures 1 and 2 


The Empty-Cell Situation 


Figure | shows the situation in which the analyst knows nothing of the link be- 
tween the two variables involved—what we term the empty-cell situation. For each of 
a number of areas, we know the sum of individuals for each row and column of the 
data matrices (indicated by the letter K in the figure), but have no information about 
the values for their internal cells (indicated by the letter U). Therefore any analyses of 
spatial variations can address the question of “are there differences in the size of the 
cell values from place to place?” through studies of the relevant row and column 
sums only—the so-called method of ecological correlation. Similarly, we know only 
the row and column totals for the summed matrix, shown on the right-hand side o 
the figure (i.e., the sums of all the individual area matrices 

As an example of this situation, assume that the rows in each matrix in the column 
refer to individuals’ social class and the columns to the party they voted for at a given 
election. Regressing, say, the percentage of the votes cast for party I against the per- 
centage of the population in class A may provide circumstantial evidence of the 
strength of support for party I among class A: if there is a strong positive relationship, 
then the more members of class A in an area the more votes there are for party [— 
from which we may draw the ecological inference that members of class A tend to 
vote for party | 

But that is an inference only; it could be quite feasible to find relatively high corre- 
lations between the two variables at the ecological scale from regressions involving 
the row and sum columns, but only small percentages of the row totals in the relevant 
diagonal cell of the data matrix. Thus party | may have obtained 40% of the votes cast 
in an area with 45% of its population in class A, and 60% of the votes in another area 
where 55% are in class A. But the proportion of those in class A who voted for party I 
may vary considerably across the areas. Because party I tends to perform well in areas 
where class A is large does not nece ssarily imply that party I gets much of its support 
from class A, or that most members of class A tend to vote for party lL. 

This issue of reconciling aggregate and individual patterns—is one aspect of the 
widely appreciated ecological fallacy: can we infer patterns at one scale (that of the 
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individual) from those at another (the aggregate)? Over the five decades since Robin- 
son (1950) and Goodman (1959) first raised the issue, social scientists have focused a 
great deal of attention to answering this question.' One of the most recent at- 
tempts—which has attracted a great deal of attention (not least among geographers: 
see Sui et al. 2000)—was that of Gary King in his book A Solution to the Ecological 
Inference Problem (King 1997) 

King’s particular purpose was to estimate the cell values for the summed matrix, 
shown on the right-hand side of Figure 1 (in large part because he was addressing the 
problems of research trying to produce estimates of those cell values for court cases 
that were reliant on estimates of various ecological relationships, such as the propor- 
tion of blacks and whites who vote at U.S. elections). Using a combination of two pre- 
viously suggested procedures (the method of bounds and the method of moments: 
see Kousser 2001; Lewis 2001), his method produced what he claimed were more re- 
liable estimates than those produced by others—and also more “sensible” in that be- 
cause the method of bounds was involved it was impossible to produce nonsense 
estimates (such as that 105% of those who voted for party I at the first election did so 
again at the second). 

King’s focus was on the cells of the summed matrix but one of the by-products of 
his method is that it also produces estimates of the cell values in each of the matrices 
for the separate areas so that, for example, one could obtain estimates not only of the 
percentage of those voting for | among class A for the entire area (the right-hand ma- 
trix), but also for each of its constituent sub-areas (the left-hand matrices). This has 
allowed analyses of spatial variations in such estimates across the sub-areas, as in Bur- 
den and Kimball (1998) work on split-ticket voting at the 1988 U.S. federal elections 
and O’Loughlin’s (2000) on voting for the Nazi Party in 1930s Germany. 
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Fic. 1. The Ecological Inference Problem: the Empty-cell Situation. 
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King’s book has attracted a great deal of attention because of its claims to solve a 
major problem in quantitative social science—though its qualities have not been 
universally accepted by critics (see, for example, Cho 1998; Freedman, Klein, and 
Ostland 1998; and the reply of King (1999). For geographers interested in spatial 
variations in the cell values of ecological matrices, it offers substantial possibilities. 
But it also has drawbacks, two of which are relevant to the current discussion: 1) it is 
basically only applicable to 2 x 2 matrices, and applications to larger problems (such 
as the 4.x 4 matrices shown in Figure 1) are fraught with difficulty; and 2) it doesn't 
allow for constraints other than those of the row and column totals in each sub-area. 
For situations involving both of these, an alternative approach to estimating spatial 
variations is desirable. 


The Sum-of-Cells Situation 


Such an alternative approach has been available for some time (Johnston and Hay 
1982, 1983), though the accuracy of its estimates has not been extensively evaluate od. 
King (1997) was able to evaluate the accuracy of his estimates—at the global level of 
the matrix on the right-hand side of Figure 1—because he had the relevant data. This 
section sets out that alternative method—what we term the sum-of-cells situation: a 
later section provides by far the strongest evidence yet available of its precision and 
accuracy 

The nature of this situation is exemplified in Figure 2, which, like Figure 1, gives 
the known (K) and unknown (U) values in the same ecological problem. The clear 
difference between them is that in Figure 2 the internal cell values are known for the 
global matrix (on the right-hand side). These provide extra constraints to the estimat- 
ing procedure, additional to the row and column sums employed by King. Whereas 
King’s ecological inference (EI) method answers the question, What is the most likely 
value in each cell in each matrix (sub-area and global), given the constraint that those 
cell values must sum to the relevant row and column totals? 
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The question addressed with the procedure employed here (EMax) is What is the 
most likely value in each cell of each sub-area matrix, given the constraints that those 
cell values must sum not only to the relevant row ond column totals but also to the 
relevant cell total in the global matrix (the face sum in a three-dimensional data 
cube)? Thus, to return to the example, whereas King’s EI method estimates the 
global number of party I's supporters who are also in class A, and as a by-product ob- 
tains estimates of spatial variations in that figure across all of the sub-area matrices, 
EMax estimates the latter given knowledge of the global sum. It fills the empty cells 
in such a way that they sum to the known global values: it estimates geographical vari- 
ations in a known rather than an unknown quantity. 

The EMax procedure was introduced to geographers, for example, by Alan Wilson 
(1970) in his work on transport flows. Its ads aptation to problems in e ‘lectoral geogra- 
phy has been set out on a number of occasions (as in Johnston and Pattie 2000).? 
EMax produces the maximum-likelihood estimates for each of the unknown cell val- 
ues, given the constraints. Using the example in Figure 2: 

¢ the cell values in each row of each sub-area matrix must sum to the known row 
total for that matrix: 
the cell values in each column of each sub-area matrix must sum to the known 
column total for that matrix; and 
the values for each cell across all sub-area matrices must sum to the value of that 
cell in the global matrix. 


Estimates of the number of party I's voters in class A in each sub-area are therefore 
consistent not only with the pattern of voting and class composition there but also 
with the total number of party I voters in class A across all sub-areas. 


Applying the EMax Procedure 


The nature of the EMax procedure has been set out in a number of papers (e.g., 
Johnston and Pattie 2000), and so will not be reproduced here. It employs an iterative 
procedure to identify the npepecoeny! v4 individuals across the cells in the sub-area 
matrices that is most likely to occur (i.e., the maximum- entropy solution) given the 
constraints of the row, column, and fac ‘e sums of the three-dimensional data cube. 
Those are the maximum likelihood estimates, the most likely distribution given the 
absence of any other constraints. Fitting involves using the iterative proportional fit- 
ting procedure first set out by Deming and Stephan (1940; see also Mosteller 1968). 


THE ACCURACY OF EMAX ESTIMATES 


Although EMax has been used for some two decades to produce estimates of the 
geography of a variety of aspects of voting behavior, one of the drawbacks of these ap- 
plications has been the lack of corroborative evidence regarding the accuracy of the 
estimates. (Until recently, only one data set was identified which allowed even limited 
testing: Johnston, Hay, and Rumley 1983, 1984.) Analysts (and their readers) have 
had to take on trust that the estimates produced are reliable—i.e., are accurate rep- 
resentations of the “real” (unknown) values—and the veracity of analyses of the geo- 
graphies so revealed have similarly been based on that trust. (One comparative study 
of the merits of various atte mpts—pre- King—to resolve the EI problem did re port 
favorably on it —Cleave, Brown, and Payne 1995; and it was applied by one other re- 
searcher: Berg 1988.) The availability of a new data set allows a much more detailed 


2. Although this literature seems to be largely unknown among geographers: Johnston and Pattie 
(2001). 


~ 
rg 
ie 


6 / Geographical Analysis 


check on the claims than has heretofore been possible; it provides the best-ever test 
of the accuracy of the entropy-maximizing estimates. 


MMP Elections in New Zealand 


After two national referendums, in 1996 New Zealand held its first general election 
to the unicameral national Parliament (the House of Representatives), using a pro- 
portional representation electoral system—Mixed-Member Proportional (MMP). 
(Previous elections had used the first-past-the-post system in single-member con- 
stituencies.) This system involves what are, in eflect, two elections held contempora- 
neously. Electors have two votes: 1) a constituency vote, to elect a single MP to 
represent their local constituency, in a first-past-the-post contest—at the 1999 gen- 
eral election there were 67 such constituencies; and 2) a list vote, to elect MPs from a 
particular party—at the 1999 general election, there were 53 such MPs. 

The procedure then works as follows: 


1. 67 MPs are elected to the House of Representatives from the single-member 

constituencies; 

The percentage of the list votes won by each party is calculated, to indicate the 

overall percentage of seats in the House to which each is entitled: entitlement is 

constrained to parties which either get 5% of the list votes nationally or win one 
or more of the 67 constituency seats; 

3. The party composition of the House overall is thus determined (i.e., the 120 
seats), based on the percentages calculated for the eligible parties under step 2; 
For each party, the number of seats won in the single-member contests (step 1) 
is subtracted from its total allocation of seats (ste Pp: 3), and the re maining seats to 
which it is entitled are allocated from the list. 


2. 


The list votes are crucial to determining the composition of Parliament, and so to the 
nature of the government. Members of the electorate should thus cast their list vote 
for their preferred party (what is known as their sincere vote): the more votes it gets 
in this contest, the more seats it will get in Parliament, irrespective of its number of 
victories in the constituency contests. Their constituency vote need not be for a can- 
didate from that party however: they may decide to vote for a candidate from another 
party—perhaps because of her/his individual characteristics or because the v believe a 
constituency vote for the candidate of their preferred party in the list contest would 
be wasted because he/she has no chance of winning there. There may thus be some 
split-ticket voting across the two contests, with members of the electorate voting for 
different parties in each. 

How much split-ticket voting is there—and does it vary geographically, across the 
67 constituencies? The answer to the first of these questions is given by the Electoral 
Commission, which assembles data from the voting returns on the volume of split- 
ticket voting: in 1996, 37% of the electorate voted a split ticket, as did 35% in 1999. 
The percentage that voted a split ticket varied across the constituencies, however: 
from 31 to 77 in 1996 and from 28 to 73 in 1999. (On the 1996 election and the esti- 
mates of split-ticket voting derived then, see Johnston and Pattie 1999. 

If people vote a split ticket, what is the pattern of the splits—how many vote for 
each pair of parties at the two contests? And do these percentages vary across the 
constituencies? The answer to the first of these questions is provided by sample sur- 
veys of the electorate. The 1999 New Zealand Election Study a ted a large sur- 
vey of 5382 voters, giving the split-ticket voting matrix in Table 1. This shows, for 
example, that of the 428 who voted for the Alliance in the st he st | 144) 34% also 
voted for its candidate in the constituency contest, with a further 209 (49%) switching 
their support to the Labour Party candidate, and only small numbers splitting their 
ticket to support another party. 
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TABLE | 
The New Zealand Election Study Split-ticket Voting Matrix, 1999 


Constituency vote 


A 3 7 22 
ACT 4 93 3 5 "32 226 l 3 
CC 2 2 56 l 19 30 2 12 
G 21 10 l 64 123 21 9 35 
I 200 19 8 78 1750 89 43 134 
N 1S 110 10 13 145 1189 21 85 
NZF 5 l 2 6 2 

O 14 10 1] 55 


Norte: Key to parties: A—Alliance 


O— Other 


ACT—ACT. CC—Christian Coalition, G—Green, L—Labour, N—National, NZF—New Zealand First 


The second question can be addressed using EMax. For each of the 67 constituen- 
cies used at the 1999 general election, we know—from Electoral Commission re- 
turns—the number of votes for each candidate in the constituency contests and for 
each party in the list contest; these form the column and row totals in the sub-area 
matrices in Figure 2. The national split-ticket matrix (Table 1), grossed up from the 
sample size to the total electorate, provides the third set of constraints—the global 
split-ticket matrix. With these three, we used EMax to estimate the unknown cell val- 
ues for each of the 67 constituency matrices. 


The Accuracy of EMax Estimates: Party Loyalists by Constituency 


How accurate are those estimates? The Electoral Commission publishes the total 
number of straight-ticket votes (i.e., votes for the same party at both contests) in each 
constituency—these are the sum of the diagonal values (top-left to bottom-right) in 

each constituency matrix. In addition, it produces counts for the number of straight- 
ticket votes for each party—i.e., for the individual diagonal values in each con- 
stituency. This gives us the actual values for eight of the 64 cell values in each of the 
67 matrices, against which the accurs icy of the EMax estimates can be checked. 

Scatter-plots of the relationships be ‘tween the actual and estimated number of loy- 
alists for each party and for the total are given in Figures 3 through 11. In all but one 
of these, the closeness of the two values in every constituency —and thus of each con- 
stituency to the best-fit linear regression line—is very substantial indeed. Even with 
those parties that performed much better in a small number of constituencies than 
they did in the great majority (Alliance, New Zealand First, Green) the fit is very 
good across all constituencies (and there is no undue leverage on the regression 
caused by the “outliers”). The estimates and the actual values are extremely similar in 
all cases. 

The regression parameters in Table 2 confirm the visual conclusions drawn from 
the nine figure *s. Excluding the one deviant case (other parties: see below), the average 

value of r* ios the seven individual parties is 0.961 (which is also the value of r* for all 
‘ total graph in Figure 11), and the standard deviation is only 
0.03, aden a very tight clustering about the mean—all providing clear evidence of 
very close fits. Further, the a values are all close to zero and the b coefficients close to 
1.0, indicative of a virtual 1:1 ratio between the actual and estimated values. 

The only exception to this overall patte rn of very close fits between estimated and 
actual straight-ticket voting is for “other” parties. The reasons for this are straightfor- 
ward. A consid rable number of small parties stood in the list contests, but many of 
them fielded at most just a few candidates for the constituency contests (several 
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TABLE 2 


Regressions of Estimated Against Actual Numbers of Party Loyalists 


Labour 2 0.913 0.954 
National 2 0.923 0.980 
Alliance 0.SS6 0918 
New Zealand First 0.906 0.988 
ACT O.S73 0.959 
Green 5 0.937 0.963 
Christian Coalition 1.109 0.934 
Other O.S19 0.402 


Total 7 O.SSS 0.961 
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Fic. 3. Actual and Estimated Values of Labour Straight-ticket Voters, by Constituency 
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4. Actual and Estimated Values of National Straight-ticket Voters, by Constituency 
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Fic. 5. Actual and Estimated Values of Alliance Straight-ticket Voters, by Constituency. 
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6. Actual and Estimated Values of New Zealand First Straight-ticket Voters, by Constituency 
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Fic. 8. Actual and Estimated Values of Green Straight-ticket Voters, by Constituency 
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Fic. 9. Actual and Estimated Values of Christian Coalition Straight-ticket Voters, by Constituency. 
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Fic. 10. Actual and Estimated Values of “Other” Straight-ticket Voters, by ( onstituency 
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Fic. 11. Actual and Estimated Values of Total Number of Straight-ticket Voters, by Constituency 
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fielded none). And electors who voted for one of those parties in the list contest may 
well have supported the candidate of a further “other” party in the constituency con- 
test—preferring her/him instead one of the seven “major parties.” 

Overall, these results suggest that a great deal of confidence can be placed in the 
EMax estimates of the cells in the 67 constituency split-ticket matrices. No evidence 
is available to check the veracity of the off-diagonal estimates, but the circumstantial 


evidence from the diagonal values is very clear: they provide very good estimates of 


the number of electors who voted for one party in the list contest but for another in 
the constituency. 


ACCOUNTING FOR THE GEOGRAPHY OF STRAIGHT- AND SPLIT-TICKETS 


The purpose of making these estimates is not simply to map the variation in the 
volume of straight- and split-ticket voting across the constituencies but also to seek 
explanations for those variations. This section reports on preliminary steps leading to- 
wards such accounts, using first scatter diagrams to explore the nature of the ex- 
pected relationships, and then regression analyses to test the veracity of the visual 
interpretations. 


Visual Explorations 


As noted above, split-ticket voting is likely to occur when the constituency candi- 
date of the party voted for in the list contest has little chance of winning the seat, in 
which case the elector might switch her/his vote to the candidate of anothe *r party. In 
applying this general argument to the New Zealand situation in 1999, the relative sit- 
uation of the various partie s must be taken into account. 

Before the change in the voting system in 1996, two parties—Labour and Na- 
tional—had dominated New Zealand politics for some sixty years. These continued to 
dominate the constituency contests at the first MMP election in 1996, with National 
winning 30 of the seats, and Labour 26: the Alliance, ACT, and United each won one 
seat (in each case their candidate was formerly prominent in one of the two main par- 
ties), and the populist New Zealand First Party won 6 (including the five Maori seats; 
their other was won by a former National Cabinet Minister). Apart from those that 
they won, the four “minor parties” represented in the House did not come very close 
to victory in any others: there was only one seat where ACT had a good chance of vic- 
tory and none for the Christian Coalition; the Alliance had an outside chance (a 1996 
vote of 20% or more) in eight and New Zealand First in six; the Green Party was part 
of the Alliance in 1996. 

As general expectations, therefore, we expected to find: 1) that loyalty to each party 
(i.e., voting a straight ticket for each) would be greater in 1999, the greater the party's 
chances of victory in the constituency contest; and 2) that split-ticket voting from one 
of the “minor parties” to one of the “big two” would be larger, the greater the latter 
party ’s chances of victory in the constituency contest. To test whether this was so, we 
used the relevant party's percentage of the votes cast in 1996 in the constituency as a 
measure of its chances of victory there in 1999.° 

Figures 12 and 13 are the scatter diagrams for straight-ticket (i.e., party loyalty) 
voting for each of the two largest parties. (In all of these scatter diagrams, we have fit- 


ted the lowest moving-average best-fit line, using 33% of the points.) In the case of 


Labour, the relationship is as expected: the better its performance in 1996, the larger 


3. The constituency boundaries were changed between the two elections, and a further two con- 
stituencies created. Estimates of the result in 1996 as if the election then had been fought in the new con- 
stituencies were created by Alan McRobie using polling booth data: we are indebted to him for providing 
us with these data 
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the percentage of its 1999 list voters who also supported the party's candidate in the 
constituency contest then (Figure 12). For the National party, the same relationship 
holds (the two constituencies where it fielded no candidate have been excluded) ex- 
cept that it retained the loyalty of a relatively low percentage of its list voters in most 
of its safest seats, producing an inverted-U relationship—although the downturn on 
the right-hand side of the graph is very much a function of a single observation only 
(Figure 13). 

Turning to the “minor parties,” there were two on the left of the political spectrum 
in 1999—the Alliance (dominated by former members of the Labour Party, who were 
stronger in their “socialist” leanings than those who had remained with Labour) and 
the Green Party (which was part of the Alliance in 1996 and fielded no constituency 
candidates then). Figure 14 shows, not surprisingly, that loyalty to the Alliance in 
1999 was greatest in the only constituency that it would have won in 1996 (recall that 
we are using estimates of the 1996 results because of constituency changes: over 80% 
of those who voted for the Alliance list there also voted for the Alliance constituency 

candidate). Where the Alliance had little chance of victory, however, there was quite 
a wide spread of loyalty figures. The reason for this is shown in Figure 15, which indi- 

cates a clear positive relationship between the number of Alliance list voters who 
switched their constituency support to the Labour candidate and Labour's 1996 per- 
formance: the stronger Labour was in a constituency in 1996—and thus the better its 
chances of victory in 1999—the more support it picked up from those whose list vote 
went to the Alliance. If your party has no chance of success in the constituency, it 
seems, you vote for the party on the left of the political spectrum that does. And ex- 
actly the same applied to the Green Party list voters who switched to Labour in the 
constituency contest (Figure 16). 

Turning to the two smaller parties on the right of the political spectrum—ACT (the 
Association of Consumers and Taxpayers) and the Christian Coalition—Figures 17 
and 18 look at the percentage of their list voters who switched their constituency sup- 


D 
So 807 
> 
2 
32 70 7 
2 
2 
3 60 4 
> 
— 
5 50 4 
40 4 


Labour Constituency Percentage 1996 


Fic. 12. Labour Constituency Vote Percentage in 1996 and Loyalists 1999, by Constituency 
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Fic. 13. National Constituency Vote Percentage in 1996 and Loyalists 1999, by Constituency 
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"IG. 14. Alliance Constituency Vote Percentage in 1996 and Loyalists 1999, by Constituency. 


100 

90 oO 

Oo 

== 

70 

Oo 

60, O 

: 

40 

3010 

0 10 20 30 40 50 60 70 80 e 

100 

80 

60 

40 re) 

0 

0 10 20 30 40 50 f 

e 


Alliance List % Labour Constituency 


Labour Constituency Percentage 1996 


Fic. 15. Labour Constituency Vote Percentage in 1996 and Alliance-Labour Split-ticket Voters 1999, by 
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Fic. 16. Labour Constituency Vote Percentage in 1996 and Green-Labour Split-ticket Voters 1999, by 
Constituency 
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Fic. 17. National Constituency Vote Percentage in 1996 and ACT-National Split-ticket Voters 1999, by 
Constituency 
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Fic. 18. National Constituency Vote Percentage in 1996 and Christian Coalition-National Split-ticket 
Voters 1999, by Constituency. 
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,ort to the right’s major party—National. Again, the patterns are as expected—the 
etter National's performance in 1996, the more who switched to it in 1999, although 
igain National performed relatively poorly in its safest seat where it was subject to a 
strong challenge from the ACT candidate." 

Finally, there is the populist New Zealand First Party. This was initially formed as a 
breakaway party from National in the mid-1990s. After the 1996 election it held the 
balance of power between the two main parties and following prolonged negotiations 
agreed to enter a coalition with National. The coalition broke up in 1998, however, as 
did New Zealand First. In 1999, it won greatest loyalty from those who supported it 
in the list contest where its 1996 performance was best—although the scatter of 
points is fairly wide (Figure 19). Among those who voted a split ticket, however, Fig- 
ures 20 and 21 suggest that most switched their support at the constituency contest to 
the Labour candidate, especially if he/she had a good chance of success: however, 
where the National candidate was well placed to win, there was a greater shift in that 
direction—although as Figure 20 shows, much of the positive relationship results 
from very small split-ticket shifts in that direction in the six constituencies where Na- 
tional performed especially badly in 1996: excluding those there is only a slight posi- 
tive relationship, suggesting that after the 1995 coalition break-up most New Zealand 
First list supporters preferred Labour to National.’ But the better Labour's chances 
of victory in a seat, the larger the percentage of New Zealand First's list voters who 
switched their support to the Labour candidate (Figure 21 
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Fic. 19. New Zealand First Constituency Vote Percentage in 1996 and Loyalists 1999. by Constituency 


4. This was the Auckland constituency of Epsom where the ACT candidate came a close second in 
1999, having achieved only an estimated 3% of the votes in 1996. The ACT candidate in 1999 was an in 
cumbent MP. who had won a list seat in 1996 (see Vowles 2000 
5. These are the six Maori seats, which would have been won by New Zealand First in 1996 but pro- 
duced resounding victories for Labour in 1999 (Figure 20 
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Fic. 20. National Constituency Vote Percentage in 1996 and New Zealand First-National Split-ticket 
Voters 1999. by Constituency 
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Fic. 21. Labour Constituency Vote Percentage in 1996 and New Zealand First-Labour Split-ticket Vot- 
ers 1999, by Constituency 
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Regression Analyses 


These exploratory visual explorations suggest the validity of our hypotheses relat- 
ing to the geography of straight- and split-ticket voting. This section reports on more 
formal tests, using OLS regression—and only fitting linear trends. In these, we test a 
further hypothesis, relating to campaign activity. 

The logic underpinning the tests reported above is that voters act rationally in de- 
ciding whether to support their preferred party in their local constituency contest or 
instead vote a split ticket, switching their constituency support to another party. Some 
may well act in the latter way, having assessed their preferred party's chances of suc- 
cess in the constituency contest. But many will have been unprepared to take the 
time and effort to do that—to collect the relevant information (remember that the 
constituencies had changed since 1996). As theoreticians of democracy have been ar- 
guing since Downs (1957), voting choices have to be based on information, but the 
cost of collecting information may be considerable and many may feel it not worth 
the effort—given pe low probability that how they vote will influence the result of 
the election. They are thus largely dependent on any information provided to them 
free, which takes no time to access and little to assess. 

In constituency-based electoral systems, a major source of such information is the 
political parties and their candidates who are campaigning for votes. Their campaigns 
are designed to provide electors with the needed information, and to encourage them 
to support a particular candidate. A range of studies (including some on New 

Zealand: Johnston and Perry 1954; Johnston and Pattie 1999) have shown that in gen- 
eral the more that a party spends on its constituency campaign (mainly on leaflets and 
posters), re ‘lative to other parties the better its pe ‘rformance there. At the 1999 gen- 
eral election, candidates in the constituency contest were limited to spending no 
more than NZ$20,000 on their campaigns, and we take these figures (provided by the 
Electoral Commission) as a measure of campaign intensity. 


We ran the 


regressions in three stages 


1. Including just the constituency vote percentage in 1996 for the party supported 
in the list contest in 1999 

2. Adding the constituency vote percentage in 1996 for the party supported in the 
constituency contest in 1999: and 

3. Adding the spending by the two parties involved at the 1999 constituency con- 

test 


There is no second stage for the straight-ticket regressions. Our expectations were 
that the coefficients for the list partys 1996 vote would be positive, as would those for 
its own spending; the other two should be negative. For regressions involving the Na- 
tional Party these were run twice: once for all constituencies where it fielded a candi- 
date in 1999 and the other excluding the aberrant case—Epsom—to ensure that this 
was not biasing any observed trend. 

The results in Table 3 provide very strong support for the general model. The av- 
erage rm value across the ten regressions at the third stage was 0.57; replacing the al- 
ternative regressions for National (in italics in Table 3) it is 0.61. In addition, all but 
two of the regression coefficients were significant at the 0.05 level or better (signifi- 
cant coefficients are shown in bold); the exceptions were those for ACT and Christian 
Coalition spending respectively in the regressions relating to splits from those to Na- 
tional. In general terms, therefore: 


The better a party's chances of victory in a constituency, the larger the percent- 
age of its list voters who also support its constituency condidsto—i, e., vote a 
straight ticket for the party; 

This tendency to vote a straight-ticket in constituencies that the party has a 
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TABLE 3 


Regressions of Straight- and Split-ticket Votes on Party Performance in 1996 and Party Campaign Spend- 
ing in 1999 


Vote im b coefhicients (stage 3) for 
Vote on List Constituency 2 Lh LSP CSP 


Labour Labour 0.38 _ 0.43 0.45 _ 0.67 — 
National National 0.43 ~ 0.53 0.34 — 0.73 —_— 
061 0.67 0.47 _ 0.59 
Alliance Alliance 0.22 — 051 0.64 — 1.44 — 
Alliance Labour 0.25 0.56 0.65 ~0.62 052 —0O.88 0.56 
Green Labour — 054 0.64 0.57 0.61 1.23 
ACT National 0.15 0.19 0.32 0.23 -1.26 O17 1.14 
0.08 043 0.54 —1.16 0.49 ~0.33 0.90 
Christian Coaln National 0.49 0.50 0.62 0.58 0.20 —0.13 0.48 
0.00 0.56 0.65 0.59 0.24 —0.09 0.44 
NZ First NZ First 0.36 _ 0.66 0.22 — 2.08 — 
NZ First Labour 0.29 0.60 0.66 -O.15 0AL -—0.70 0.34 
NZ First National 0.36 0.60 0.70 0.09 0.11 -—0.33 0.16 
0.50 061 071 -0.09 012 -—0.36 0.14 


Notes: Key to variables: L66—list party's constituency vote percentage in 1966, O66—constituency partys constituency vote percentage in 
1966, LSP—list party s constituency spending in 1999: CSP— constituency party's constituency spending in 1999. Significant regression coef 
ficient sat the 0.05 leve 1, are shown in bold. The regressions in italics are those involving the National party and omitting the constituency of 
Epsom 


good chance of winning is accentuated in those where it also spends heavily on 

its local campaign—the greater the intensity of that campaign, the greater the 

number of straight-tickets as a percentage of its list supporters; 

3. The better one of the two major parties’ chances of success in the constituency 
contest, the larger the percentage of those who supported another party on its 
side of the political spectrum who switched to voting for it in the constituency 
contest—i.e., split their ticket in favor of the stronger party on that side of the 
spectrum; 

4. This ability to attract split tickets in constituencies where they had good chances 
of success was accentuated in those constituencies where the major parties 

campaigned most intensively; 

5. The success of the large parties’ campaigns in attracting split-ticket votes was 

mitigated to some extent by the intensity of the smaller parties’ local cam- 

paigns—the more that the small parties spent, the smaller the outflow of split- 
ticket votes to the larger party (whatever its spending), although this conclusion 
applied only to the parties of the left (Alliance and Green) 


The geography of split- and straight-ticket voting in New Zealand in 1999 was consis- 
tent witch our hypotheses. 


CONCLUSION 


Most attention with regard to the ecological inference problem has focused on 
what we have termed the empty cell situation (Figure 1), where there is no prior 
knowledge of the internal cell values of any matrix within a data cube. Estimating 
those unknown values is the goal—with special attention focused on the unknown 
values in the global matrix, although the values in the individual area matrices are of 
considerable interest to geographers and others concerned with spatial variations in 
the behavior patterns under consideration. 

There are, however, other ecological inference problems, where the overall focus is 
the same—estimating unknown values in the internal cells of a data cube—but fur- 
ther information is available on which the estimates can be based, which do not exist 
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for the “classic” empty cell situation. One of these is what we have termed the sum- 
of-cells situation (Figure 2), in which a further set of information is available at the 
global scale. This can be used as a third constraint in procedures designed to solve the 
ecological inference problem, and thereby produce estimates of spatial variation in 
behavior which are consistent with the overall pattern in that behavior 

A method for resolving the ecological inference problem in this sum-of-cells situa- 
tion (EMax) has been in use for some time, but it has not been possible to evaluate 
the accuracy of its estimates of the geography of behavior in any major way. This dif- 
ficulty has been overcome in the analyses reported here, using data on straight- and 
split-ticket voting at the 1999 New Zealand general election. The estimates of 
straight-ticket voting produced by EMax, given the availability of data for the three 
sets of constraints, have been compared with the “real” values collated by the Elec- 
toral Commission, and found to be extremely accurate. For this data set at least, we 
have great confidence in the estimated off-diagonal, split-ticket voting estimates, 
given the obvious success of the procedure in estimating the on-diagonal, straight- 
ticket votes. 

The strength of these findings suggests that there are clear benefits to be gained 
from wider application of the EMax methodology, including comparisons of its per- 
formance—where relevant—with that of EI. There is a wide range of potential appli- 
cations for both of these procedures in geography (as Sui et al. 2000, identified for 
El), exploration of which might lead to ways of merging the two. 

The purpose of solving the ecological inference problem in geographical studies is 
to enable studies to be undertaken of spatial variations in the behavior under consid- 
eration, and to test hypotheses regarding the patterning of that variation. That has 
been illustrated here with analyses of the geography of straight- and split-ticket vot- 
ing at New Zealand's 1999 general election. We have been able to show that those 
patterns are consistent with expectations based on rational choice models. People 
who support a party in the list contest are more likely to support its candidate in the 
constituency contest as well if he/she has a good chance of success there—and are 
particularly likely to do so if that candidate campaigns hard for their support. 
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Measuring Impacts of Condition Variables in 
Rule-Based Models of Space-Time Choice Behavior: 
Method and Empirical Illustration 


The use of rule-based systems for modeling space-time choice has gained increasing 
research interests over the last years. The potential advantage of the rule-based ap- 
proach is that it can handle interactions between a large set of predictors. Decision 
tree induction methods are available and have been explored for deriving rules from 
data. However, the complexity of the structures that are generated by such knowledge 
discovery methods hampers an interpretation of the rule-set in behavioral terms with 
as a consequence that the models typically remain a black box. To solve this problem, 
this paper develops a method for measuring the size and direction of the impact of 
condition variables on the choice variable as predicted by the model. The paper illus- 
trates the method based on location and transport-mode choice models that are part 
of Albatross model—an activity-based model of space-time choice. 


INTRODUCTION 


The recent planning literature has witnessed the exploration of decision tree tech- 
niques for analyzing space-time behavior. Decision tree induction methods, which 
have been developed in statistics and artificial intelligence, provide a technique to de- 
rive decision trees from observations on choice. The application of the technique for 
developing predictive models of space-time choice has been explored in several re- 
cent studies (Thill and Wheeler 1999: Gahegan 2000a; Arentze et al. 2000; Wets et al. 
2000; Yamamoto, Kitamura, and Fujii 2002; Moons et al. 2002). In particular, we have 
used the technique to develop the activity-based model Albatross (Arentze and Tim- 
mermans 2000, 2002). Several empirical studies showed that the technique for such 
purposes is promising in terms of internal validity, performance relative to conven- 
tional utility-based models and spatial transferability (Arentze et al. 2001). Gahegan 
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(2000b) stresses the potential of inductive learning as a method, besides scientific 
visualization, to address data with high attribute dimensionality. 

The strength of the decision tree technique for modeling space-time behavior is its 
ability to represent the full complexity of interactions between condition variables. 
Unlike algebraic models (e.g., Bowman and Ben-Akiva, 1999), the specification of the 
model does typically not rely on the commonly used linear-additive utility functions, 
possibly with some selected interaction effects. At the same time, however, this com- 
plexity hinders an interpretation in terms of the impacts of condition variables on the 
choice variable. Whereas it is relatively straightforward to derive elasticities in dis- 
crete choice models, the non-algebraic nature of decision-tree based models prevents 
one from such easy interpretation. 

Measuring the importance of condition variables has received some attention in 
Statistics and Artificial Intelligence mainly for two applications. First, it has been 
considered for identifying the most significant features among a large set of poten- 
tially relevant expl: unatory variables in a pre-modeling stage (e.g., Kira and Rendall 
1992: Kononenko 1994). Feature selection methods that have bee *n proposed take 
into account possible correlations between condition variables and try to measure 
the independent contribution of condition variables to observed differences in 
choice for example by using information theoretical concepts (e.g., Piramuthu 
1996). In a second application, the objective is to rank conditional variables accord- 
ing to the contribution that they make in classifying or predicting the target vari- 
able in a decision tree model (e.g., Steinberg and Colla 1997). Thus, in both 
applications attention is focused on the potential or apparent significance of vari- 
ables for prediction or classification. We emphasize that methods that have 
emerged from this area are not transferable to the present problem. Elasticity 
refers to a different dimension, namely the extent to which pre dictions are sensitive 
to variation over a hypothetical range of the condition variable of interest. To the 
best of our knowledge, methods for deriving such information from rule-based 
models do not exist. 

To fill this gap, the aim of this paper is to introduce a method to derive elasticity in- 
formation from rule-based models. The principle of the proposed method is straight- 
forward. After having derived a rule-based model from training data, the model is 
used to predict a choice frequency distribution multiple times for a same set of cases 
under different simulated settings of one condition variable at a time. Then, elasticity 
or impact is conceptualized as the size of the difference in choice frequency distribu- 
tion between levels of the condition variable. Because a model, rather than data, and 
simulated variation over the full range of a condition variable, rather than existing 
variation, is used the resulting measures can be interpreted as elasticity or impact co- 
efficients. 

The proposed method is illustrated using decision tree models for location and 
transport mode choice that are part of the Albatross model system. The original ver- 
sion of Albatross was derived from a limited study area. The results presented here 
are based on a re-induction of the system conducted to extend the study area to the 
national scale. Given the large scale of the data set used, the elasticities derived from 
the models should provide potentially useful information for time geographical re- 
search. 

To introduce and illustrate the method, the paper is structured as follows. The 
next section first describes the format of rule-based models as assumed in the 
method. Then, the second section introduces the proposed method for extracting 
elasticity coefficients. The section that follows discusses the results of the case 
study to illustrate the method. Finally, the paper concludes with summarizing 
major findings. 
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2. AREPRESENTATION FORMAT OF RULE-BASED MODELS 


The method developed in the next section is suitable for rule-based models based 
on the decision tree formalism, but is not restricted to that formalism. Therefore, this 
section describes requirements for a rule-based formalism in a more generic way. 

The assumed model considers a “a rete set of choice alternatives i and pre ‘dicts a 
choice probability for each alternative i. The model consists of a set of decision rules 
possibly (but not necessarily) peered as a decision tree and predicts a choice prob- 
ability for each given case as: 


ple)=>, Pr(kie) Pri(ijk) Vi (1) 


where k is an index of a decision rule, c is a vector of condition values observed in the 
given case, Pr(k | ¢) is the probability that ¢ meets the condition state of rule k, and 
Pr(i | k) is the probability that rule k generates prediction i. To be consistent and com- 
plete, the rule-set she mild meet the following conditions: 


>, Prikle) = 1 Ve 
Priilk) =1 vk 


In conventional rule-based formalisms, which we consider here, a rule is activated in 
a deterministic way implying that Pr(k | c) is either zero or one in equations (1) and 
2). The condition state of each rule defines a conjunct set of necessary conditions. 
So, the rule selection function can be written as: 


if c, ECSy AcyECSy 


Prikie 


10, otherwise 


where CS, C CD, is the state of the s-th condition variable in the k-th rule, CD, is the 
domain of the s-th condition variable and e = (c,...c,). If the condition variable is 
continuous, a state is generally defined as a certain subrange of CD,. Otherwise, if 
the condition variable is discrete, a distinction is made between ordinal and nominal 
variables. A state of an ordinal variable is restricted to subsets of the domain that pre- 
serve the order of the elements, whereas a state of a nominal variable can take on any 
subset of the domain. Format (4) does not necessarily imply that every condition vari- 
able is relevant in every rule. Irrelevance of a variable is re presente sd by a special de- 
finition of its state. That is, if s is irre 2 ‘vant in the k-th rule we define CS, = CD,. 
This means that every possible value of c, meets the condition so that in fact c, is ir- 
relevant for determining whether the « case matches k. 

Decision tree induction is one of several possible methods for finding structures of 
type (4) that best fit data on observed choice. The term ‘training set’ is generally used 
to refer to the sample of observations used to derive a rule-set. The fit of a decision 
tree is conceptualized as the degree of homogeneity or purity of choice outcomes at 
leaf nodes. Several heuristic methods have been proposed in AI and statistics to find 
the optimal tree. Top-down methods develop a tree by recursively splitting the train- 
ing set on condition variables until a single choice outcome is observed within parti- 
tions or no further improvement is possible. Different criteria of splitting have been 
proposed including a measure of information gain (C4.5, Quinlan 1993), the Gini- 
index (CART, Breiman et al. 1984) and a Chi-square based measure of significance of 
difference between partitions (CHAID, Kass 1980). Other differences between 
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(3) 
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methods exist, but in the context of this study the process by which trees, or rule sets 
in general, are derived is not relevant as long as the format confirms to definitions 
(1)-(4) 

The second term on the right hand side of equation (1) defines the predicted 
choice probability of i for any case matching rule k. In decision tree applications, an 
all-or-nothing action assignment rule is commonly used. The so-called ‘plurality’ rule 
assigns the modal choice of training cases at leaf node k to each unseen case assigned 
to k. Such a rule does maximize expected probability of correct predictions, but fails 
to reproduce remaining variance in choice outcomes, if any, at leaf nodes. In planning 
applications, it is generally important to reproduce une xplaine -d variance in predic- 
tions. To come up to that need, Albatross uses a probabilistic assignment rule instead. 
Hereby, choice probabilities are derived from the training set as: 


Pr (ijk) = 


ny 


where f;, is the frequency of choice i at leaf node k and n, is the total number of cases 
at leaf node k in the training data. However, in the present context, it is not important 
how the action assignment rule is derived as long as constraint (3) is met. 


3. DERIVING IMPACT TABLES 


Given a decision tree of the form as defined by (1)-(4), this section outlines a 
method of deriving impact measures for each condition variable s. We emphasize that 
the purpose of the measures is not to replace the rule-based model, but rather to fa- 
cilitate an interpretation of the model in behavioral terms. The method relies on the 
principle of simulating changes in a representative sample of cases on one condition 
variable at a time and next evaluating the impact on choice predictions. The sample 
should be representative for the population to which the model is applied. In many 
cases, the training set can be used for this. In this section we describe the method in 
more detail. 

Following our earlier notation, the sample (e.g., training data) can be represented as: 


pile ©; = Vj (6) 


where as before s is an index of condition variables and j is an index of cases. The 
method assumes that each condition variable is a discrete variable with levels c, 

1....m(s). This assumption does not exclude continuous variables from the analysis, 
but rather implies that these variables should be discretisized in a pre-processing 
step. For example, one can use an equal-frequency method, which divides a con- 
tinuous scale into m intervals in such a way that each interval contains an equal num- 
ber of cases. If q is the number of choice alternatives, the model is used to predict an 
m(s) X q frequency table for each condition variable s as: 


..q, for u(s) = 1...m(s) , (7) 


where c is a manipulated vector of condition values and p, is the predicted choice 
probability of i using equation (1). The condition vector in (7) is defined as: 


= (Cij Cop ---, U(S), Vj (8) 
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(5) 
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This means that for deriving the impact frequency table a prediction run is repeated 
m(s) times replacing for each case the observed \ alue of the s-th condition variable by 
a varying value u(s) = 1....m(s) and keeping all other variables as observed. 

The fre quency table re sulting from this procedure contains the full information on 
the impact of condition variable s on choice outcomes. We propose to derive the fol- 
lowing impact measures from the table: 


IS. = DiF_.F- 


IS, = DiF,.F., 


where D(e,e) is some measure of difference between frequency tables, F, is the 
m(s) X q frequency table predicted by the model, F, is the « ‘xpected frequency table 
under the assumption of no impact of s, F,, is the i-th column of the fre ‘quency table 
generated by the model, F,, is the expected counterpart under the assumption of no 
impact of s on choice i, and f,,,., is the u(s) X i cell frequency of the table predicted 
by the model. Several goodness-of-fit measures can be specified for D such as for 
example Chi-square, the sum of absolute differences, likelihood and so on. 

As implied by the equations, the first measure gives an indication of the overall im- 
pact of variable s on the choice variable and the second measure provides a disaggre- 
gation to obtain an indication of its impact on each choice alternative i. Finally, the 
third measure is included to obtain information on the functional shape of the mea- 
sured impact. More specifically, the measure can be interpreted as a measure of mo- 
notonicity. To see this, consider the following special cases. If the condition variable 
has a monotonically increasing impact on choice frequency across the levels of the 
condition variable, then MS,. « ~quals 1, because in that case: 


whereas, if the condition variable has a monotonic ‘ally decreasing impact, the ratio 
equals — 1, because in that case: 


(13) 


In all other cases, the ratio has a value in between — 1 and 1. To see how the ratio be- 
haves within this range, it is clarifying to rewrite (12) as: 


i (14) 


This means that MS, is sensitive only to the difference in choice fre ‘quency between 
ys first and last leve J of the condition variable. If the impact function has a perfectly 

|-shape or reversed U-shi ape, MS,, is zero. In all other cases, the measure represe nts 
some degree of negative or positive monotonicity. This has a clear meaning for condi- 


; 
(9) 
(10) 
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tion variables that are measured on an ordinal or higher level or for binary condition 
variables. However, in the case of multi-level nominal variables, the order of levels is 
purely arbitrary so that the meaning of the measure is limited. 

Measures (9), (10), and (11) are complementary and will be used to derive sub- 
stantial information from a rule-based model. Statistical significance tests are consid- 
ered not relevant in this application, as there are no error terms associated with the 
measures. The procedure is rather computationally demanding but still can be com- 
pleted in reasonable time. For a given rule-set, it requires repeating a prediction run 
m(s) times for each condition variable s. 


4. ILLUSTRATION 


This section illustrates the derivation and use of impact tables with some represen- 
tative decision trees concerning location and mode choice that are part of Albatross, a 
rule-based model of travel de mand. originally developed for the Dutch Ministry of 
Transport, which predicts which activities will be conducted where, when, for how 
long, with whom, and the transport mode involved (Arentze and Timmermans 2000, 
2002). It is one of the most complex activity-based models that currently exists (see 
Timmermans, Arentze, and Joh 2002 for a review). It is not the aim of this paper to 
discuss the model in detail, but we do need to provide some background information 
to fully appreciate the context of this illustration. We therefore, first briefly discuss 
the process model in Albatross, to provide a context for discussion of results in the 
sections that follow. 


4.1. The Albatross process model 


Albatross consists of two major components that together define an activity sched- 
ule for each individual and each day. The first component generates an activity skele- 
ton consisting of activities that are supposedly fixed in the short- term, their exact start 
time, and duration. Given the skeleton, the second component then determines the 
part of the schedule related to flexible activities to be conducted that day, their travel 
party, duration, time-of-day, and travel characteristics. Both components make use of 
the same location model component determining the location of activities. All three 
components assume a sequential decision process in which key choices are made and 
pre-defined rules operate to delineate choice sets and implement made choices in the 
current schedule. 

Figures 1-3 schematically present the structure of each of the main components of 
the model. Each numbered rectangle corresponds to a decision tree, which is derived 
from activity diary data. The indices used in the figures are defined as: 


i index of activity in order of priority, i=1...1 
j index of episode of activity i in order of start time, j = 1...J 
k index of tour in order of start time, k = 1...K 


The skeleton component comprises decisions 1-13, the location component used for 
both fixed and flexible activities decisions 14-20 and the flexible activity component 
decisions 21-27. 

Clearly, there are many ways in which the scheduling problem can be broken down 
into key choices and key choices can be arranged into a sequential decision process. 
Two principles have guided the design of the process model. First, qualitative deci- 
sions are made as much as possible explicitly. For that reason, decisions such as the 
selection of an activity, the number of episodes per activity, trip linkages between ac- 
tivity e pisode s, and relative locations of activity episodes are made separately prior to 
determining their exact attributes in terms of start time, duration, location, and 
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mode. In this way, we may expect that distributions of key schedule characteristics 
will be accurate ‘ly predicte sd. Second, the sequencing of ‘identified key choices is 
based on assumptions regarding the priority of decisions. For that reason, the skele- 
ton is determined before flexible parts of the schedule and start time decisions before 
location and mode decisions, ete. 

As an important distinct feature, the skeleton model component determines activ- 
ity patterns on a continuous time scale (Figures 1 and 2). It consists of several sub- 
processes including: 


Determining the pattern of sleep activities (1, 2) 

Determining the pattern of the primary work/school activity (3-8). 
Determining the pattern of secondary, fixed activities (9-1: 3). 
Determining the location of each fixed activity episode (14-20). 


The component concerned with flexible activities (Figure 3) consists of the following 
subprocesses: 


1. Choice of transport mode for the primary work/school tour(s) (21) 

2. Determining selection, travel party and duration of flexible activities (4-24) 
3. Determining start time and trip chaining of activities (25-26) 

4. Determining the transport mode of each tour in the schedule (27) 

5. Determining the location of each flexible activity (14-20) 


The same location model is used for flexible and fixed activities. Possible differences 
in decision rules can be captured by decision trees, as activity type is incorporated as 
one of the potential condition variables. In the remainder of this section we describe 
the location model in more detail given its relevance for the illustration. 

The location component chooses locations in descending order of priority of fixed 
activities. For each activity, the choice set consists of all postcode areas' (PCAs) in the 
Netherlands. First, the model chooses the municipality and next a PCA within the 
chosen municipality. At both levels, the model determines a choice by increasingly 
narrowing down the choice set in a number of steps. For the choice of municipality at 
the highest level, the first decision tree determines whether the activity (episode) is 
conducted within or outside the home municipality of the individual. If the last op- 
tion is chosen, the choice of a municipality follows from a choice of an order and dis- 
tance band. Five orders are distinguished based on population size. First-order 
municipalities are municipalities with an above regional function, second-order mu- 
nicipalities have a regional function, third-order municipalities have a local core func- 
tion, and fourth and fifth-order municipalities correspond to smaller towns. Given the 
order, the choice of a distance band follows. The combination of order and distance 
band tends to reduce choice sets strongly. If there are still multiple alternatives left, 
the model selects a municipality semi- randomly using a parameterized probability 
function in which less distant locations within the given distance band have a higher 
probability of being selected. 

For the choice of a zone (i.e., a PCA) within the chosen municipality a similar logic 
is used. Again, locations—in this case PCAs—are classified into four order categories. 
This time, orders are defined in terms of relative size of employment within the cho- 
sen municipality. Taking the relative size implies that PCAs of the same order may 
differ in employment size between municipalities. Given the chosen criterion, first- 
order PCAs typically belong to the city center, second-order to a band around the 


1. In the model, postcode areas are defined at the level of the first 4 digits. To give an indication of spa- 
tial resolution: The Netherlands counts a total of over 4000 4PCAs. Thus, given a population of roughly 16 
million people, the average number of individuals per 4PCA is approximately 4000 people and the average 
size is approximatelh ly $80 ha 
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center, and last-order locations to the pe riphery of the municipality (cores around a 
city or the edge of cities). The choice of a PCA then is the result of a choice of order 
and next distance band in the same way as before. Again, if there are multiple choice 
alternatives left a semi-random selection is made using a similar probability function 
as in the case of municipalities with parameters specifically estimated for PCA choice. 

In sum, the model takes into account that location choice has a random component 
and, at the same time, correlates with order and distance. By using this approach, the 
observed distribution of locations across combinations of size and distance category 
should be accurately predicted. The structure of each decision tree separately should 
take care of reproducing correlation structures between independent variables within 
this distribution. 


4.2. Data 


The empirical derivation of the model is based on four pooled activity diary data 
sets that were collected on different moments in the period of 1997-2000 in different 
regions of the Netherlands. Although the data sets were collected for different re- 
search projects, the survey methods and diary instruments used were largely the 
same. In each survey, all household members older than 12 years were invited to fill 
out a diary for two consecutive days. Diaries were balanced across days of the week. 
Only the diaries of adult members (i.e., household heads) of households were used to 
derive the decision trees. As a result, the training set used consists of a total of 9985 
person-days. The data set covers a wide range of variation within the Netherlands in 
terms of urban character and region, although areas of lower urban density and areas 
in the periphery of the country are less well represented. 

A complementary data set described the nation-wide study area. It included road 
network data, sectorial employment data, population data, and parking data. Network 
data were used to calculate mode specific fastest route distances and travel times be- 
tween PCAs. Public-transport system data were only available at a higher spatial ag- 
gregation level. In sum, the physical data allowed us to measure conditions in terms 
of travel times by time-of-day and mode, travel costs per mode, parking space and 
costs, and availability and size of facilities by activity type. The impact tables de- 
scribed in the section that follows are based on the training data. 


4.3. Illustration of impact measures 


To illustrate the impact measures, we consider here location and mode-choice de- 
cision trees as examples. The location model predicts for each out-of-home activity 
the location where the activity is to be conducted. As implied by the process model, 
this involves seven decision trees (see Figure 2). Arbitrarily, we focus here on the 
choice between home municipality and other and, for cases where ‘other’ is chosen, 
the decision whether or not the nearest municipality of the chosen order is chosen. 
These two decisions largely determine the choice of making a short, local trip or a 
longer, above-local trip. Transport mode decisions involve two decision trees, one for 
the choice of a mode for each work/school trip and one for the choice of a mode for 
each tour for non-work/school activities. 

4.3.1. Condition variables and model performance. Tables 1—5 show the condition 

variables used in deriving the decision trees. The choice of condition variables re- 
fects the availability of physical data, the stage of deve ‘lopme nt of the schedule in the 
concerned step in the decision process, and theories of which variables may moderate 
decisions for the concerned facet. The first table re presents household and individual 
attributes that were used for all decision trees. The second table represents the con- 
dition variables that were used for the location decision trees considered here. The 
last three tables show the general set and spe cific sets used for the transport-mode 
decision trees. In addition to socio-economic variables, the tables gene rally include 
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TABLE | 


Socio-economic and Situational Attributes Used in all Decision Trees 


Label Definition Lavels 


Urb Urban density 0: highest density, 4: lowest density 

Day Day of the week 0: Monday, 6: Sunday 

Comp H composition 0: single 0 worker, 1: single | worker, 2: double 1 worker, 3 
double 2 workers, 4: double 0 workers 

Child Children category 0: no children, 1: younger 6, 2: 6-12, 3: older 12 

Age Age categor 0: < 25, 1: 25-44, 3-45-64, 4: > 64 

SEC Socio-economic class 0: low (below modal), 3: high (two times modal or more) 

Near # of cars in household 0: no cars, 1: | car, 2: 2 or more cars 

Driver I has driving licens« 0: is not a driver, 1: is driver 

Gend Gender of I 0: female, 1: male 

Wstat Work status of J 0: no work, 1: < 32 hours week, 2: 32 hours or more week 

Pwstat Work status of P 0: no work, | 32 hours week, 2: 32 hours or more week 

Nore: H oncerned household: concemed individual. P partner of « moemed individual 


TABLE 2 
Condition Variables for Choice of Municipality (DT 15, 17 


Definition 


Categones 


Adur Duration (min.) of activity under concern 0: <0, 5}, 1: <5,15], 2: <15,60] 
}:< 60,209), 4: <209.459), 5: > 459 

Act Type of activity under concern 0: work/school (long), 1: work/school (short 
2: bring/get, 3: vol. work, 4: other fix, 5: day 
shop., 6: service, 7: n.-day shop., 8: social, 
9: leisure 

Wadu Total duration of work/school activity 0: <0, 385], 1: <385,490], 2: <490,525), 

}:< 525,560). 4: > 560 


OrdeO Order of home municipality 0: lowest 4: highest 
Ordel- Order of chosen municipality 0: lowest 4: highest 
Orderbi Maximum order across munic ipalities 0: lowest 4: highest 


available in distance band i 


Nearbi Nearest band for municipality of order 5-i 0: first band 4: fifth band 

Avbi Chosen order is available in band i 0: no, 1: ves 

WvailT Width (min.) of time window for activity 0: <0, 130), 1: <130,270}, 2: <270,415}, 

}:< 415,654], 4: > 654 

Aflex Activity under concern is flexible type 0: no, 1: ves 

Avgem Another municipality is available in time 0: no, 1: ves 
window for activity 

TRpucai- Travel time ratio (%) public-transport/car to 0: <0, 171], 1: <171,201], 2: <201,232], 
nearest mun. of order i = 3.5 3:< 232,249], 4: > 249 

CRpucai- Travel cost ratio (%) public-transport/car 0: <0, 110], 1: <110,150], 2: <150,175], 
to nearest mun. of order i = 3..5 3:< 175,228], 4:<228,400], 5: > 400 

Cparki Average hour tariff of long parking of 0: <0,0], 1: <0,19], 2: <19,32], 3:<32,92], 
nearest mun. of order i = 3.5 4:<92,275], 5: > 275 

Availi- Municipality of chosen order is available in 0: no, 1: ves 


band i given time window 


Nores: ‘Included only in DT 15: “Included only in DT 17 


~ 
Label 
4 


TABLE 3 
Condition Variables Generally Used for Transport Mode Choice (DT 21, 27 


Label Definition Categones 
TRpubi Travel time ratio Public, Bike to loc 0: <0, 1.0], 1 1.0.1.2], 2: <1.2,1.5}, 
3: <1.5,1.84], 4:<1.84,2.48], 5: > 2.48 
TReoff Car travel time ratio in-peak, out-peak to loc 0: <0, 1.0], 1: <1.0,1.53], 2: <1.53,2.15} 
3: <2.15,2.87], 4:<2.87,3.87], 5: > 3.87 
TRpuca Travel time ratio Public, Car to loc 0: <0, 0.73}, 1: <0.73,1.0], 2: <1.0,1.56] 
3: <1.56,2.64], 5: > 4.21 
CRpuca Cost ratio Public, Car to loc 0: <O, 1.1], 1: <1.1,1.5}, 2: <1.5,1.75}, 
3: <1.75,2.28], 4:<2.28,4.0], 5: > 4.0 
: TRvona Ratio access + egress time, total Public 0: <0, 18}, 1 18,25], 2: <25,32 
; travel time (% 3: <32,40), 4:<40,46), 5: > 46 
. PRbeta Ratio of paid parking places % 0: <0, 9], 1: <9,17], 2: <17,23], 3: <23,62] 
4: > 
Cpark Average parking price per hour (fl cent 0: <0, 0], 1: <0,19], 2: <19,32], 3: <32,92] 
4: > 92 
TABLE 4 
Condition Variables for Transport Mode Choice for Work/School Trips (DT 21 
Label Definition Categones 
Wdu Total duration of work/school activity 0: <0, 385], 1: <385.490], 2: <490.5: 
in S (min 3:< 525,560), 4: > 560 
Wdautot Total duration of work/school and 0: <0, 240), 1: <240,360), 2 <360,450 |, 
4 voluntary work activity in S (min 3: > 480 
Nsec # Of out-of-home non-work activities in S$ 0: 0, 1: i, 2: 2, 3: [3,4], 4: 5,5: > 5 
Byet Bring/get activity included in S 0: no, 1: ves 
Pwdu Total duration of work/school activity 0: <0, 385}, 1: <385,490], 2: <490,525], 
in S of P (min 3:< 525,560), 4: > 560 
Pwdutot Total duration of work/school and 0: <0, 240), 1: <240,360), 2: <360,480}, 
vol. work activity in S of P (min 3: > 480 
Pnsec # Of out-of-home non-work activities in Sof P 0, 1: 1, 2: 2, 3: [3,4], 4:5, 5: >5 
Pbget Bring/get activity included in S of P 0: no, 1: ves 
inAM Travel to work/school in morning peak 0: no, |: ves 
inPM Travel back home in evening peak 0: no, 1: ves 
WduT Duration of work/school activity in T (min 0: <0, 300), 1: <300,443]}, 2: <443,495] 
3:< 495,525), 4:<525,555]. 5 555 
Cadist Car distance to location (km 0: <0, 3.1], 1: <3.1,6.4], 2: <6.4,12.7] 
3:<12.7,22.0], 4:<22.0,45.8], 5: > 45.8 
Treon There is a train connection to loc 0: no, 1: yes 
Nwloc # Of different work locations on T 0:1,1:>1 
vee, Avo Activity before work/school 0: none, |: bring/get, 3: other 
Ana Activity after work/school 0: none, |: bring/get, 3: other 
Pwo P has work overlapping in time with T 0: no, |: ves 
Phget P has bring/get overlapping in time with T 0: no, 1: ves 
Pnact # Of fixed activities of P during T 0:0,1:1,2:>1 
, Pdist Longest car distance across activities of P 0: <0, 0}, 1: <0,2.71], 2: <2.71,5.76], 
overlapping in time with T (km 3: <5.76,14.18], 4:<14.18,34.95] 
5: > 34.95 
Inuse Car in use by P for work trip by 0: no, |: yes 
prev dec mon 
Noves: S ts current schedule. P is partner of the individual: T is the work-based tour unc + concern 
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TABLE 5 


Condition Variables for Transport Mode Choice for Non-work Tours (DT 27) 


Label Definition Categories 


Avear Availability of a car in time window 0: no, 1: yes, 2: unknown 
Nsec # Of out-of-home non-work activities in S$ 0: 0, 1: 1, 2: 2, 3: [3,4], 4:5, 5: > 5 
Wdu Total duration of work/school activity 0: <0, 385], 1: <385,490], 2: <490,525], 
in S (min.) 3:<525,560), 4: > 560 
Wdutot Total duration of work/school and 0: <0, 240), 1: <240,360), 2: <360,480], 
voluntary work activity in S (min.) 3: > 480 
BTchain Episode of the day start time of T 0: 1-st episode .... 5: last episode 
Atyl Type of first activity in T 0: work/school (short), 1: bring/get, 2: vol. 
work, 3: other fix, 4: day shop., 5: service, 
6: n.-day shop., 7: social, 8: leisure 
Aty2 Type of second activity in T 0: work/school (short), 1: bring/get, 2: other 
fix, 3: {day shop., service, n.-day} shop., 
4: |social, leisure} 
Aduri Duration class of first activity in T : short, 1: medium, 2: long 
‘brget There is a bring/get activity in T no, 1: yes 
‘groc There is a daily shop activity in T no, 
lserv There is a service activity in T : no, 


‘soco There is a social activity in T : no, |: ye 
‘leiso There is a leisure activity in T - no, | 

] 


‘nlout There is a work or vol. work act. in T no, 


( 
( 
( 
Cshop There is a non-daily shop activity in T : no, 
( 
( 
( 
( 


‘adist Car distance of longest link in T (km) : <O, O], 1: <0,2], 2: <2,4], 3: <4,6], 
<6,10), 5:< 10,30), 6:<30,80], 7: > 80 
Pbrget P has bring/get activity overlapping in : no, 1: yes 
time with T 
Pserv P has service activity overlapping in time : no, 1: yes 
with T 
Pstat Status of T relative to partner 0: unknown, 1: no overlap, 2: overlap no car, 
overlap car trip 
Pdist Distance (km) of tour of P overlapping 0: unknown, 1: no such tour, 2: 0, 
with T 3: [1, 10>, 4: [10, 45>, 5: = 45 


accessibility indicators, travel costs variables and potentially relevant characteristics 
of the current schedules of the individual and partner, if any. In particular, the relative 
attractiveness of the transport modes is indicated by travel-time and travel-costs ra- 
tios between modes and availability of parking facilities (Table 3). All continuous vari- 
ables were discretisized using the equal-frequency interval method. 

A CHAID-based decision tree induction method was used to derive the decision 
trees, whereby the entire sample was used for training. Table 6 shows some statistics 
of the decision trees that are considered here. Successively the columns represent the 
minimum number of cases per leaf node (nmin), which is a parameter of the induc- 
tion method, the number of condition variables (ncond), the number of choice alter- 
natives (nobs), the number of leaf nodes of the resulting tree (nleafs) and several 
measures of goodness-of-fit. The first two fit measures express an expected ratio of 
correct predictions under a probabilistic action-assignment rule of type (5) for a null 
model and the derived model respectively. Hereby, the null model is defined as the 
tree consisting of the root node only. So, this model only uses the overall sample 
choice frequency distribution for prediction. ¢,,,, Is a combination of the first two 
measures and indicates the increase in fit on a zero-one scale. 

As Table 6 indicates, the decision trees all perform satisfactory in terms of predic- 
tion accuracy (e) and yield a substantial improvement in fit compared to the null 
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model (e,,..). Furthermore, the number of leaf nodes gives an indication of the size of 
the trees. The largest tree has 105 leaf nodes and the smallest tree still has 51 leaf 
nodes. In general, tree size tends to be correlated with the number of cases available 


for training, on the one hand, and the minimum group size setting (nmin), on the 
other. Sizes as we find here are not exceptional and trees would even have been big- 


ger if survey data of a national-scale had been used. This clearly demonstrates the 
problem of interpreting decision trees in terms of impacts of condition variables on 
choice 

4.3.2. Location choice. Tables 7 and § display the impact tables for the location de- 
cision trees considered here. In these tables as well as in the impact tables considered 
in the next section, we used a Chi-square measure of fit as a specification of D in 
equations (9 and (10). The rank order of variables on this measure can be interpreted as 
a rank order of impact on the choice variable overall (the first 1S column) or per level 
of the choice variable (the next 1S columns). Moreover, the size of the difference be- 


TABLE 6 


Some Statistics of the Decision Trees Used as F xampl N 


DT id DT 


L. mun. in/out 
L. mun. nearest 
Mode to work 
Mode to non-work 


TABLE 7 
Impact Table: Choice Between Home Munic ipality and Other (1 


Urb 

Comp 

Chik 

Day 

Age 

SEC 

Near 

Gend 

Driver 

Wstat 27.93 11.74 
Pwstat 0.07 
Adur 21.2 51.12 
Act IS911.43 7676.01 
Wadu 5.00 

4857.42 

Orde! 

Orderb 

Orderb2 

Orderb3 

Orderb4 

Orde rb5 

Ne irb 

Ne arb2 

Nearb3 

Ne arb4 

Nearb5 

Avail 

Aflex 


Avgem 


abel ' ' alt bis healt ‘ ‘ ‘ 
15 90 29 2 IST5S 105 0.512 0.625 0.277 
ay 17 79 8 2 7932 55 0.503 0.727 0.451 
21 6 3665 51 0.351 0.590 0.338 
27 90 8 ' 9523 56 0.425 0.607 0.317 
15 
Condition Is hone 1Sioth MSi home MStother 
6.76 0.4 0.54 
S.1S O17 O17 
19.26 0.04 0.04 
7.42 0.64 0.64 
3 0.08 0.04 0.04 
6.06 0.44 0.44 
“yy 971 0.56 0.56 
35.73 1.00 1.00 
1.43 1.00 1.00 
16.20 O.SS 
0.10 1.00 1.00 
70.13 1.00 1.00 
11235.31 O.1S 
: 2.86 1.00 1.00 
. 2992.63 0.92 0.92 
0 
0 
0.07 1.00 1.00 
0.60 1.00 1.00 
0 
0 
0 
oh 26.83 1.00 1.00 
0.86 1.00 —1.00 
0 
0 
60.08 0.70 0.70 
t 0.97 1.00 1.00 
12.97 5.58 7.39 1.00 1.00 
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TABLE 
Impact Table: Choice Between Nearest Municipality and Other (DT 17 


Condition near MS 


other 


Urb 0 22 

Comp 0 0 0 

Child 5.29 2.83 2.46 1.00 ~ 1.00 
Day §.20 4.4] 3.79 - 1.00 1.00 
Age 0.29 0.16 0.13 0.33 —0.33 
SEC 32.89 17.42 15.46 L.00 1.00 
Near 1.59 O.S5 0.73 1.00 —1.00 
Gend 4.32 2.33 1.99 —1.00 1.00 
Driver 0 0 0 

Wstat O58 031 0.27 —1.00 1.00 
Pwstat ().22 0.12 0.10 —1.00 1.00 
Adur 19.93 10.75 9.18 —1.00 1.00 
Act 0 0 0 

Wdu 0.38 0.21 O17 -~1.00 1.00 
OrdeO 99.93 56.85 $3.08 ~().52 052 
Orde! 7224.67 4737.97 2486.66 0.74 —0.74 
Orderb! 653.28 $12.79 240.48 0.15 -0.15 
Orderb2 Y9S 5.39 158 1.00 —1.00 
Orderb3 1.74 0.94 0.80 0.04 —0.04 
Orderb4 2.21 1.21 1.01 1.00 —1.00 
Orderb5 0 0 0 

94.68 53.91 40.77 1.00 —1.00 
Avb2 0 0 0 

Avb3 0.77 0.42 0.36 1.00 1.00 
Avb4 0 0 0 

Avb5 0 0 0 

AvailT 1.01 0.55 OAT -1.00 1.00 
Aflex 1.44 O.77 0.66 1.00 1.00 
avgem 0 0 0 

TRpuca3 0 0 0 

TRpucat 15.60 8.49 7.11 — 1.00 1.00 
TRpuc ad 189 2 66 2.24 0.01 —0.01 
CRpuca3 25.48 13.77 11.70 0.08 —0.08 
CRpuca4 580.03 305.22 274.80 0.78 —0.78 
CRpuci 0 0 0 

Cpark3 0.00 0.00 0.00 1.00 —1.00 
Cpark4 0 0 0 

Cpark5 0 0 0 


tween condition variables on this measure can be interpreted as the size of the differ- 
ence in impact. One should keep in mind however that the difference in Chi-square 
represents a squared difference in relative impacts. Finally, the MS columns give the 
degree of monotonicity of the impacts per level of the choice variable. 

First, we consider the choice between conducting the activity under concern 
within (home) or outside (other) the own municipality of the individual (Table 7). In 
this case, the choice variable is a binary variable, so ‘that the measures are strongly 
correlated between choice alternatives on the IS as well as the MS dimension. The 
reason for this is that in each case the probability of the first alternative equals one 
minus the probability of the second alternative. For that reason the correlation be- 
tween MS, and MS, is even perfect: for each condition variable the measures have 
the same absolute value and opposite signs. Also in the case of IS, one may expect a 
perfect correlation since an increase in frequency of the first alternative equals the 
decrease in frequency of the second alternative. However, the Chi-square measure 
used here does not consider absolute differences but differences relative to an ex- 
pected frequency of the choice alternative under concern. It is for bees reason that 
measured impacts for Home, of which the base freque ney is 57.7 %, is consistently 
lower than for Other, of which the base frequency is 42. 3 %. vat ause we consider 
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this property of Chi-square as appropriate, 1S, or IS, is redundant in the impact 
table only in the sense that either one of the two can be found by the relationship 
IS = IS, + ISp. 

Now feshing at the contents of the table, we see that activity type (Act, a ten-level 
variable) is by far the most important variable for this choice. The monotonocity 
index MS in this case is close to zero (+0.18) as could be « xpected given the fact that 
activity type is a nominal variable and there is no information in the order of the lev- 
els of the variable. The second most important variable is order of the home munici- 
pality (Orde0). This variable is measured on an ordinal scale whereby a higher value 
indicates a higher order. The high monotonicity index of 0.92 for Home indicates that 
choice of Home almost monotonically increases with order, which is what one could 
expect. The third most influential variable, i.e. the width of the available time window 
for the activity (availT), has an impact of a lower order of magnitude. Here the mo- 
notonocity index for Home is also positive implying that the choice of Home increases 
with increasing time availability. An explanation of this counter-intuitive relationship 
can be found by examining the unde rlying decision tree. As it appears, the direction 
of the impact varies depending on the specific branch of the tree (i.e., the context) in 
which the condition variable plays a role. A negative impact seems to be related to op- 
portunities of linking the activity to another higher-priority activity, especially work, 
that takes place outside the home municipality and that, at the same time, tends to re- 
duce the available time window for the activity. So, the negative relationship may be 
explained by a wish to combine the activity with another activity at a distant location. 
Activity duration is the fourth most important variable with a perfectly monotonously 
decreasing effect on Home choice, as one would expect. 

As for the socio-demographic variables, a number of observations are relevant. 
First, all variables have some impact. In order of decreasing importance we find gen- 
der, children, work status, day of the week, household composition (Comp), income 
(SEC), driver, work status (Wstat), number of cars (Near) and driving license (Dri- 
ver). For binary variables (Gender and Driver) and ordinal variables (Day, SEC, 
Near) the monotonocity index has a meaning and the outcomes are well inter- 
pretable. For example, being male decreases the probability of choosing Home, as do 
higher income and later days in the week. 

Finally, condition variables of the spatial structure of activity locations also have an 
impact We alre ady mentioned the order of the home municipality, which is by far the 
most important one of this category. Urban density of the home location has a nega- 
tive impact. This is what one would e xpect given the fact that the variable is coded 
such that a lower value indicates higher density. The accessibility variables (Orderb1- 
5 and Nearb|-5) tend to have very small impacts. Only the nearest band for an order 
2 municipality has a meaningful impact on Home choice in the anticipated direction. 

The second example (Table 8) concerns the choice of a municipality in cases where 
Other was chosen in the previous step. The decision occurs after the choice of a mu- 
nicipality order and determines whether or not the nearest municipality of that order 
is chosen. A first observation is that overall the impacts are slightly lower than in the 
previous case. The overall size of impacts is determined by the combination of two 
properties of the decision tree, namely the size of the sample and the increase in 
goodness-of-fit achieved by the tree (e,,,.,.). Chi-square values increase with both sam- 
ple size and the fit measure. In this case, the forces work in opposite direction, as the 
decision tree has a lower number of training cases (7932 vs. 18758) and a higher fit 
(0.560 vs. 0.468). As it appears, the effect of sample size is bigger than the effect of fit, 
in this case 

Turning to the contents of Table 8, the chosen order for the municipality (Orde1) 
is by far the most important variable for the decision. The choice of nearest increases 
almost monotonically (MS = 0.74) with increasing order, which is what one would ex- 
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pect. The second most important variable is the order of the nearest band (Orderb1). 
There is a tendency (MS = 0.15) of increasing probability of Near with increasing lev- 
els on this variable. Intere stingly, the next most influential variable is the cost ratio 
between public transport and car for reaching the nearest municipality of the second 
highest order. The higher this ratio the higher the predicted probability of Near 
(MS = 0.78). This suggests that the relative costs of public transport acts as a proxy 
for accessibility of high order municipalities. In addition, the availability of a munici- 
pality of the chosen order in the nearest distance band (avb1) has an impact with an- 
ticipated sign (MS = 1). 

With respect to the relative impacts of different types of condition variables, sev- 
eral observations are relevant. First, socio-economic variables have relatively small 
impacts in this stage of the decision process. Income level (SEC) stands out as the 
socio-economic variable with the biggest impact. The choice of Near decreases mo- 
notonically (MS = —1) with increasing income level. The same holds for having chil- 
dren (Child) and being male, be it that the impacts of these variables are smaller. 
Second, the relative attractiveness of transport mode measured by costs ratios and 
travel time ratios have impact. We already mentioned the cost ratio between public 
transport and car, which has the biggest impact. In addition, the relative travel time 
by public transport to the nearest fourth-order municipality has a monotonically de- 
creasing impact on Near. Finally it is worth noting that parking prices do not affect 
predictions at this level, except that the choice of nearest municipality decreases with 
increasing parking tariff in nearest order of 3. 

4.3.3. Mode choice. In Albatross, mode choice is made separately for primary 
work/school trips and tours for other activities. In both cases, the mode alternatives 
considered include slow (non-motorized), car driver, public transport, and car pas- 
senger. Impact tables related to these decision trees are shown in Tables 9 and 10 and 
can be interpreted as follows. 

First, as it appears from Table 9, distance to the work/school location is by far the 
most influential variable with expecting monotonicity levels for the different mode al- 
ternatives. Car passenger (Pass) undergoes a less monotonous impact across levels of 
this variable. The next most important variables are number of cars in the household 
(Near), ratio paid/free parking places (PRbeta) and having a driving license (Driver) 
with high levels and expected directions of monotonicity. All these variables monoto- 
nically increase the choice of car driver and decrease the choice of public transport in 
the model. 

When we look at the differential impacts of types of condition variable, we see that 
socio-economic variables have only a limited impact here, whereas attributes of the 
transportation system have a relative big impact. Especially, travel time ratios and 
parking tariffs for the chosen destination emerge with substantial impacts. Interest- 
ingly, the capacity ratio of paid/free parking places (Pbeta) is the most important vari- 
able in this category and monotonically decreases the choice of car and increases the 
choice of public transport. Height of parking tariff (Cpark) has a similar but smaller 
impact. Furthermore, the access/egress time ratio in public transport services 
(TRvona) tends to decrease the choice of public transport, but the impact is small. Fi- 
nally, it is worth noting that congestion on routes to the work location is influential. 
The travel time ratio between congested and free floating conditions (TReoff ) both 
decrease the choice of car and increase the choice of public transport monotonically. 

Second, also for tours for other purposes than work/school (Table 10) distance 
(Dist) is the primary moderator of mode choice followed (at a distance) by number of 

cars in the household (Near). As for socio-economic variables, Gender (Gend) 
emerges as the strongest moderating factor in the model. Being male increases the 
choice of car and decreases the choice of a slow mode. The opposite impacts of work 
status of the individual and that of his/her partner suggest an impact of traditional 
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role patterns within households. Of the cost and travel time variables the ratio 
paid/tree parking (Pbeta) is again the most important variable with perfect monoto- 
nous impacts in expected directions. Height of parking tariffs (Cpark) is the second 
most important with impacts in the same directions. Access/egress time ratio of total 
public transport time (TRvona) works in favor of slow mode and reduces public trans- 
port choice. Finally, it is worth noting that the activity pattern of the partner (if any) 
has some impact here. In cases where the partner makes a tour overlapping in time 
(Pstat) the choice of car decreases and the choice of a slow mode increases. 


5. CONCLUSIONS AND DISCUSSION 


Tree induction methods have been developed to explore the complexity in data of 
observations. In the context of space-time data, these methods have been applied to 
extract decision rules from activity-travel diaries either as an explorative tool or as a 
means of calibrating a rule-based computational process model. Although these mod- 
els claim to provide a more detailed insight into context-dependent spatial choice be- 
havior, these rule-based models may be criticized as being too much of a black box. 
Unlike algebraic models such as the gravity model and the competing destinations 
models, rule-based models do not give elasticities, and hence are difficult to interpret. 

The present paper presented a possible solution to this problem. The concept of 
impact tables has been introduced and a simple method of deriving such tables has 
been developed and illustrated in the context of the Albatross model system. The 
measures that were introduced indicate for each condition variable and each decision 
tree or rule-set in general the sensitivity of the predicted choice for levels of the con- 
dition variable. In addition, an index of monotonicity was developed to obtain infor- 
mation on the direction of the impacts. As illustrated by location and mode choice 
decision trees derived for the Albatross model, the imp: act table provides substantial 
information for theory development and policy making. The impact table allows one 
to rank condition variables and assess the difference in impact between condition 
variables, so that the most relevant variables for policy making can be identified. For 
example, the results of the case study suggests that parking price policies and trans- 
port-system service level policies are effective moderators of transport mode choice, 
whereas the spatial structure of locations for activities is more influential for the 
choice of activity locations. Such observations are particularly meaningful in the con- 
text of an activity-based model, which takes into account a large set of activity sched- 
ule and household context variables that may constrain and moderate location or 
mode hoices 

Besides advantages we should point to a potential limitation of the approach. It has 
been found that the structure of empirically derived decision trees tends to be sensi- 
tive to the selection and way of operationalizing of variables in applications where the 
variables are strongly intercorrelated (Quinlan 1993). Depending on such choices of 
the analyst, certain condition variables may or may not be selected by the induction 
method in developing the tree. As a consequence, one could obtain varying values of 
impacts of condition variables. This observation does not degrade the method devel- 
oped here, as it explicitly intends to measure the sensitiveness of a given model and 
not the generalizability of its predictions. However, making elasticities explicit makes 
one more aware of the problem. This problem is not unique for rule-based models, 
but is closely related to the problem of multi-collinearity in estimating parametric 
choice models. Furthermore, instability of decision tree structures tends to be lim- 
ited to those condition variables that make only a small unique contribution to pre- 
diction accuracy. This means that big elasticities tend to be more stable and hence 
better generalizable across model specifications. Using an appropriate feature selec- 
tion method could increase stability of rule-based models. 
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Urban Land Use For Transport Systems 
and City Shapes 


Urban land use is contingent upon the transportation system which is in turn depen- 
dent on business development in the city. The urban economics literature influenced 
by the Solow- a y model has extensively studied the above relationship. In the 
present paper we re-examine the Solow-Vickrey model of the long narrow city by 
defining a new general mathematical formulation in which the urban shape is a fune- 
tion of the allocation of land for transport. We focus in particular on emergence con- 
ditions for different types of polye entric cities. The methodolo; gy discussed in the 
paper provides a new analytical framework for assessing the impact of urban trans- 
port flows on city growth 


INTRODUCTION 


Structural and qualitative transformations in cities are often due to extensions of 


and new developments in transport infrastructure, in particular roads (Wang 2000; 
Martinez and Araya 2000; Peeters, Thisse, and Thomas 1995; Wheaton 1998; 
Cervero and Wu 1997: Berry and Kim 1993: Ladd and Wheaton 1991). The rise in 
road traffic and the subsequent increase in land use devoted to road infrastructure 


has produced many externalities that directly and indirectly affect the morphology of 


our cities. 

The increase in urban land earmarked for transport systems is often initiated by 
the need to alleviate the congestion problem in urban areas. Travel time, and subse- 
quently travel cost, increases when the movement from an origin to a destination is 
influenced by other travellers using the same infrastructure. More precisely, conges- 
tion occurs when the capacity of the transport infrastructure is insufficient to meet si- 
multaneously all demands of travellers, thus causing an increase in travel time. 
Congestion is therefore closely related to the characteristics of transport infrastruc- 
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ture (such as the capacity of the road) and to the characteristics of traffic flows (such 
as the composition of the traffic flows), and it is therefore indirectly but certainly re- 
lated to the formation of the city’s specific shape. 

From the point of view of congestion, both the density and the volume of traffic are 
two fundamental traffic parameters that are able to yield better insight into urban 
land-use. In urban economics literature, a standard reference is Solow and Vickrey 

(1971), who assume in their model that the business districts in a city are uniformly 
distributed in the entire urban area and that each business district is an origin and a 
destination of traffic flows. The objective of their theoretical model is to examine the 
optimal allocation of urban land for transportation. Volume and density of traffic 
have, therefore, a prominent place in their analysis of urban form. 

The aim of the present paper is to re-examine the model of Solow and Vickrey (1971) 
and to extend and generalise the spatial scope of the model. In particular, our atten- 
tion will focus on how, in mathematical terms, transport may shape cities and thus on 
how the minimization of transport costs and the emerging new city shape may deter- 
mine the spatial layout of the urban transport network. Before proceeding with our 
analysis, we will first address a number of extensions of the traditional Solow-Vickrey 
model in order to put our work in context in relation to previous analyses. 


EXTENSIONS OF THE SOLOW-VICKREY MODEL 


In this section we examine some of the most relevant examples in the literature 
that attempt to reformulate the structure of the Solow and Vickrey model. The mod- 
els of Mills and de Ferranti (1971) and Solow and Vickrey (1971) are the first at- 
tempts in the literature to study the allocation of urban land between transport and 
other land uses by means of sophistic ated mathematics-calculus of variations, linear 
programming and control the OFV. 

Mills and de Ferranti (1971) observe that congestion costs may determine differ- 
ent location choices and travel patterns in relation to the standard model of Alonso 
(1964). This is due to the distortion of the land rent values which do then not reflect 
the opportunity cost of land. In fact, when deciding on their residential location, 
households do not consider the congestion costs imposed on other residents; they 
take into account only the average travel cost and thereby set in motion a chain of 
events, the most significant being both a greater use of transport and a population in- 
crease within the city. 

The model of Solow and Vickrey (1971) analyses the case of a road network laid out 
in a “long narrow” rectangular city in order to minimize the aggregate transport cost 
incurred in the city per unit time. Solow and Vickrey (1971) consider a rectangular 
urban shape with the business district distributed uniformly in the city and with a 
given total road area. The volume of traffic is generated only by the business activi- 
ties, and the cost per ton yard of moving traffic at a specific point x is assumed to be 
proportional to the de nsity of traffic at x. The transport cost e ncompasses the conges- 
tion cost and the travel cost, i.e., the cost of travel when no other traffic is on the road. 
Two possible solutions arise from the model, that is, the optimal city size solution and 
the constricted city solution. In the former case, “the required business district can 
be accommodated at the least transport cost; congestion and distance costs are just in 
balance. The road of constant width characterizes the optimal amount of land avail- 
able” (Solow and Vickrey, 1971). 

In the constricted city solution, the urban area is constricted due to topographical 
or administrative reasons, and the road area is laid out in a rhomboid shape, that is, 
the road widens steadily from one boundary of the city towards the centre of the city 
where it reaches its maximum width and next narrows symmetrically to a zero width 
at the other boundary of the city. Therefore, the proportion of land allocated to trans- 
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portation is a ae ay and concave function of the distance from the centre of the 
city. By assuming that (1) the urban land is rented competitively, (2) no congestion 
toll is levied on road users, (3) transportation costs are absorbe “dat the trip origin by 
sellers who deliver at a uniform price, and (4) the opportunity cost of land is zero, the 
Solow-Vickrey model demonstrates that the rent function defined in the model “rises 
less sharply than it should toward the city centre because locational advantages are 
not fully reflected in private costs” Solow and Vic ‘krey, 1971). This leads to the con- 
sequent over-construction of roads, especially at the centre of the city. 

Next, the Legey, Ripper and Varaiya (1973) model corroborates the results of the 
Solow and Vickrey model. But they distinguish between the optimal city and the mar- 
ket city in their model. In the optimal city, as in the Solow and Vickrey model, a cen- 
tral authority determines the allocation of land that minimizes social costs. In the 
market city, given the rent profile, the landlords maximize their profits subject to the 
amount of capital devoted to housing, while central authorities maximize their net 
benefits subject to the land and the capital devoted to transport. In the case of the 
market city, the Legey, Ripper, and Varaiya model demonstrates that “the market city 
devotes too many resources to housing at the periphery and too many resources to 
transportation at ‘the centre” (Legey, Ripper, and Varaiya 1973). 

The first extension of the Solow and Vickrey model where a different urban shi ape 
is examined, is the model of Kraus (1974). Kraus (1974) considers the same assump- 
tions as made by Solow and Vickrey (1971), but by introducing a circular shaped city 
he extends the problem from a one-dimensional operation of the transport network 
to a radial and circumferential road system. Given the morphological specification of 
the city, Kraus (1974) mainly examines the problem of the optimal route utilization 
within the city. The study of the optimal allocation of land to transportation is con- 
fined to his Theorem 3, whereby Kraus (1974) states that a necessary condition for 
optimal land allocation between radial and circumferential roads is that the marginal 
rate of substitution of land for variable trip costs in radial travel and circumferential 
travel be equal in the production of radial and circumferential travel. The model 
therefore, shows that the provision of both the inner and outer ring road is optimal. 

A transport constellation similar to the model of Kraus (1974), but with an alterna- 
tive urban shape, is that of a square city developed in a study by Miyao (1978, 1981) 
Miyao (1978, 1981) considers a square shaped city, with a grid of horizontal and ver- 
tical roads. Miyao (1978, 1981), like Kraus (1974) in his model, examines the equilib- 
rium traffic patterns in the city. Every road user selects a cost-minimizing route 
between points of origin and destination which are the boundaries of the smallest 
rectangle, which includes the points of origin and destination at its corners. 

By using the same assumptions as Solow and Vickrey (1971), Miyao (1978, 1981) 
examines the competitive rent profile in relation to the differential transport costs. 
Given that the traffic volumes and densities are identical across all horizontal roads, 
in all points of the vertical axis that crosses at the midpoints of the horizontal roads, 
the marginal benefits are the same if we increase the width of the road. But the pro- 
file of the competitive rent has a maximum at the city centre and minimum at the 
boundary points along the vertical axis. This leads to the conclusion that “the land-use 
decisions based on market rents tend to create too many roads everywhere in the city 
at the optimum and the tende ‘nICy toward excessive road- building is stronger near the 
city bounda ries, especially at the midpoints of the boundary lines, rather than in the 
city centre” (Miyao, 197 S$). If we consider the transport cost of vertical movement as 
negligible, we can derive the Solow and Vickrey one-dimensional model from the 
Miyao model 


The models under scrutiny in our analysis will adopt a standard definition of the 
volume of traffic, but they assume a spec ific definition of density of traffic. Standard 
definitions are available in the traffic engineering literature (see Daganzo, 1997). The 
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Solow and Vickrey model—and the subsequent extensions of it—assume ~~ each 
unit of business area generates g tons of traffic per unit of time, and that y(x) is the 
total business located at the left hand side of point x, whereas A is the total business 
area of the city. Given these assumptions, the total volume of traffic per unit of time 
passing the point x in both directions is v(x) = (2g/A)y(x) (A—y(x)). The density of 
traffic at x is assumed to be a function of the ratio of traffic volume and the width of 
the road at the horizontal coordinate x. However, if we analyse these two definitions 
from a dimensional point of view, we observe that the equations of state of traffic flu- 
idity are not satisfied, which functionally relate density and volume of traffic to the 
average traffic speed. This is because Solow and Vickrey (1971) and many other au- 
thors in the field deal with the term ‘density’ by using a non-standard definition, even 
though this does not necessarily imply invalid results. 

As we have shown, the Solow and Vickre ‘y model has induced various follow-up 
analyses, such as the optimal route choice in a city, the optimal allocation of traffic 
green zones, the effect of congestion on a city, and the analysis of the competitive 
rent profile. From the various transport land-use interaction analy ses, we focus here 
on the main core of the Solow and Vickrey model, namely the optimal allocation of 
road space in a city given by the minimization of the aggregate transport cost. In par- 
ticular, our objective is to develop a generalized analytical formulation to address the 
optimal allocation of urban land for transportation. 


URBAN SHAPES AND TRANSPORT 


Solow and Vickrey (1971) assume in their model a city with a rectangular shape of 
length L and width W. No residential land used is considered. An area A < WL is de- 
voted to the homogeneous distribution of business firms. By assuming w(x) to be the 
width of the road at a horizontal coordinate x, the total business area located at the 
left of x is given by: 


y(x) = fow —w(u))du. 


Given that y(0) = 0 and y(L) = A, we can now express w(x) = W — y’(x). The traffic 
is composed by a single transport mode. Transport cost (i.e., cost per ton yard of mov- 
ing traffic at x) is a function of the density of traffic at x: flv(x)/Av(x)) = = d[v(x)v(x)], 

where d and c are two parameters strictly larger than zero. The transport cost is neg- 
ligible in the breadth-wise direction. By assuming that a unitary business area _ 
duces g tons of traffic, the total right-to-left volume of traffic equals v(x) = (2g/A)y(x) 

(A—y(x)), so that the total transport cost in the entire city is: 


y'(A y)*! 
dj- . (1) 
0 (W-y’)' 


where: 


v(x)/w(x) is the density of traffic at location x; 
w(x) is roadway width; 
c,d are constants with c,d > 0. 


The solution of the optimal allocation of road space in the model is given by the solution 
of the minimization of the total transport cost as defined in (1), subject to y(0) = 0, 
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y(L) = A. By means of calculus of variations, Solow and Vickrey (1971) obtain then 
the area necessary for transport in the case of a rectangular city shape. Based on a 
rectangular restricted city,' they find that land allocated for transport is large near the 
centre and smaller near the city s edges. The remaining area of the city is devoted to 
business purposes. This outcome is obviously driven by the assumption of a rectangu- 
lar form of the city (used for business and transport). In the remainder of this analy- 
sis we address the question of how to analytically express the urban shape when this 
assumption is dropped. Our approach is to take the shape of the business district as 
given and derive the corresponding land demand for transport on the basis of it. The 
urban shape follows as the sum of the two. The next question to be addressed is how, 
by assuming the configuration of the business district, ie. the shape function of the 
business district, we can determine the corresponding shape of the city. 

The key assumptions of the Solow-Vickrey model are also incorporated in our 
analysis. In particular, the urban area is de fined only by both business use and road 
use. The areas allocated for business and transport are, respectively, the total area for 
business and the total area devoted for road infrastructure in the city. The sum of 
these two areas is equal to the total area of the city. 

The width of the city is now an endogenous variable in our approach. Solow and 
Vickrey (1971) assume that the shape of the city is given and that all businesses are 
dispersed within this urban area. Let us assume here, as a contrast to the Solow-Vick- 
rey approach, that the business district, where all business firms are located, has a 
rectangular shape. In order to determine the total urban area and thus the city shape, 
we will first determine the total area allocated for transport. We can then add up this 
area with the given business area. 

We assume that the transport cost incurred by the business area between 0 and x is 
a function of the total volume of traffic passing at x from left to right; thus, we adopt 
this Solow-Vickrey definition of transport cost. Our objective is to define the area in 
the city that needs to be allocated for transport use if we assume a constant ratio of 
volume to capacity of traffic in the CBD. 

In order to identify the city shape, we will first determine the area allocated for 
transport. Because we want to take into account travel flows from right to left and 
travel flows from left to right in the business district, we divide the business district 
into two equal parts along the lengthwavs side. In this case we obtain two equal 
rectangles. For any value of business area at x, y(x), we have gy(x) tons of goods mov- 
ing to various destinations. Following the Solow and Vickrey hypothesis, we assume 
that the trips with destinations to the right of x are proportional to the area to the 
right of x in relation to the business area considered. The volume of traffic at x is next 
given by: 


L 


Jadu 
v(x) = gf adu * 


adu 


0 


where: 


1. We focus our attention on the case of the restricted city, since we believe it is more realistic to con- 
sider a city where its area is restricted by its physical as well as political restrictions on its area. 
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a is the (constant) width of the business district; 
L is the length of the business district: 
g are the tons of goods for the unitary business area. 


The solution of the integral is: v(x) = gax——. Note that this equation is consistent 


with the more general formula for the entire city (which is v(x) 
(2g/A)y(x)(A—y(x))), since in our case we have: 


y(x) = (x/L)A 


and 


La=A. 


The transport cost is a function of the volume of traffic divided by the function of the 
area allocated to transport in the city. If we assume a constant transport cost per unit 
distance throughout the city (denoted as R), we then obtain: 


L 


Jadu 
R=T(x)= gf adu* - * (3) 
w(x) 
fadu 
0 


wl rere: 


T(x) is the transport cost for L = x = 0; 
k is cost per ton yard of moving traffic at x; 
w(x) is the function of the area allocated for transport at location x. 


Given the value of T(x) for all x equal to R, we can evaluate the function w(x). After 
having calculated the functional form of w(x), we can then easily integrate w(x) for 
L = x = Oand then obtain the area allocated for transport for the considered area of 
business district. We must remember that w(x) calculated in (2) will define only half 
of the area allocated for transport, since we have divided the business district area 
into two equal parts. We find that the total area allocated for transport in the city has 
an quadratic shape (see Figure 1). 


Width of the city | 


| Length of the city | 


Fic. 1. Shape of the Urban Area Allocated for Transport 
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Such a shape was expected, because we are considering a rectangular shape of the 
business district, and the central point of the business district is therefore the point 
with the highest volume of traffic. It is easy to demonstrate in this case that the city 
will have a shape given by the sum of a rectangular shi ape and a quadratic shape. 
Thus, with rather simple information on the shape of the business area, we have de- 
duced qualitative information about the urban shape. 

In the next section we examine a particular structure of the business area, i.e., a 
polycentric structure. The structure of cities is often characterized by the combina- 
tion of various centres that cluster in small urban areas in relation to the total metro- 
politan area. We have introduced this particular clustering urban structure as a 
polycentric urban structure, and in the next section we examine how to apply the 
methodology developed here in order to depict the shape of the polycentric city. 


AN EXAMINATION OF POLYCENTRIC URBAN SHAPES 


To analyse a city with a polycentric structure through the use of the methodology 
we have pre viously described, we need to know the shape of the business district. In 
the previous section we have assumed a rectangular shape; in this case, where we 
want to examine a polycentric city, we need to consider a business district having 
more than one CBD. In our first case, the polycentric business district is character- 
ized by a number of business clusters of equal size. Since we need to consider the 
lengthways section of the business district as we have done in the preceding section, 
a valuable approximation of this half of the business district shape is represented by 
the trigonometric function of the sine. 

Our consequent decision to approximate the one half of the business district shape 
by means of the sine function is based on two primary reasons. From an analytical 
point of view we obtain an elegant simplification of the business shape. Secondly, we 
need to remember that through the use of Fourier theory we are able to re ‘duce any 
arbitrary function into an infinite series of sines and cosines; this choice may there- 
fore be regarded as a general case when we consider more complex functions of the 
business district shape. 

We assume that the relevant half of the business district shape of the city can be 
described by the following function: 


F + sin(x L2=x20 (4) 


The value of F must be strictly greater than 1 in order to have a minimum urban area 
between the CBDs (See Figure 2).” 

If F = 1, we obtain separated urban areas and are therefore unable to examine a 
polycentric city. However, in this case we can study a stylised “edge city” structure. 
Given the shape of the business district, we can easily define the volume of traffic 
at X as: 


flr + sin(u)\du 
v(x) = + sin(u)|\du* = (5) 
on 


0 [F + sin(u)|du 


2. In order to achieve more clarity in the exposition, the figures in this section have the same dimension 


but not the same scale 
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| Width of business district 


Length of the city | 


4 Shape of the Business District 


Case 


We are assuming that the city is composed of three CBDs located at 1/2, 5/2 m and 9/2 
t. In this case (as in the previous), we consider the formulation of the volume of traf- 
fic according to the definition given by Solow and Vickrey (1971). 

We calculate the function of the area allocated for transport in half of the city. We 
are assuming a constant transport cost in the entire city. We then obtain: 


in 

j [| F + sin(u)\du 

Rw(x) = gk + sin(u)\du* 5r=x=0 (6) 
[| F + sin(u)|du 


0 


We assume F = 2 and gk/T = 1. In order to calculate the area allocated for transport 
in the entire city, we must sum rte result in equation (6)—related to half of the city— 
with the symmetric result, which is the total area dedicated to transport in the city. 
The following figure depicts the above calculation process: 

Figure 3 represents the shape of the area allocated for transport. In this case the 
shape of the transport area has lost the periodic pattern, which is the main character- 
istic of the shape of the business district. We also observe the dominant position of 
the central CBD in relation to the other two CBDs. By dominant position we mean 
that the geometrical centre of the city has the highest volume of traffic of the city. The 
total shape of the city, given by the sum of the business district area and the area allo- 
cated for transport, is shown in Figure 4. The shape of the city namely the sum of the 
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Fic. 4. Shape of a Polycentric City with a Periodic Business District Shape 
Case | 


shape of the business district and the shape of the area allocated for transport, high- 
lights the dominant role of the central CBD in contrast to the other two CBDs. 

We have thus far examined a business area comprised of a number of clusters; 
these clusters have the same surface and shape and are therefore equal business dis- 
tricts. Let us now consider another type of business district shape. We examine a 
polycentric urban structure, but in this case we imagine that the different CBDs have 
different surfaces and forms; i.e., we have a large business area followed by subse- 
quent smaller ones. The lengthways half of the business district shape is represented 
as follows: 


+sin(x)] L2=x20 (7) 


where F is the same as in equation (5) and b is a constant that must be greater than 1] 
and that in our case is arbitrarily set equal to 2F. To describe the shape of the busi- 
ness district we use a periodic function, but in this case we introduce a damping ef- 
fect in order to obtain the chosen business district shape (see Figure 5). 
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The volume of traffic is given by: 


v(x) = fe*[F + sin(u)|du* 
e*[F + sin(u)\du 


0 


We are assuming here a constant transport cost. We can now evaluate the function 
w(x) that indicates the area allocated for transport in the one half of the city: 


on 

Rw(x) = [F + sin(u)|du* = 
4 


+ sin(u)|du 
0 


As in the previous case, by assuming that F = 2 and gk/T = 1, we are able to depict 
the shape of the area allocated for transport (see Figure 6). 

The shape of the transport area in this second case results from the dominant role 
of the main CBD: the majority of the area allocated for transport is, in fact, in the sur- 
rounding area of the largest CBD. The implication is that the dominant CBD is the 
main origin and destination of urban trips. Our next step is to calculate the entire area 
and thus to obtain the shape of the city by summing up the area allocated for business 
uses and for transportation. We derive then the following city shape (see Figure 7). 

The shape of this city with a business district shape thus defined may, for example, 
correspond to the shape of port cities, in which the main _ is nearest to the port, 
and the city grows outward from this central point (like, e.g., in Chicago, U.S.A.). 

The method deve loped here appears thus to be a ane ‘tool for analysing the ap- 
proximation of urban shapes due to the effects of traffic flows. Our analysis highlights 
the paramount role of road traffic in the growth and development of urban struc- 
tures, and particularly in the spatial definition of the city. 
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Fic. 7. Shape of a Polycentric City with a Periodic Damped Business District Shape 


Case 2 


CONCLUSION 


The model of Solow and Vickrey (1971) and subsequent extensions have been used 
here in order to address the prob lem of the allocation of urban land to tr: unsport. The 
analysis departs from the Solow-Vickrey foundation to investigate the impact of traf- 
fic Hows on the urban shape definition. Particular attention has been given to two spe- 
cific cases of a polycentric urban structure 

The two cases emerge from the necessity to discover the spatial elements that may 
have generated the city shape. As a first step we have revised the Solow-Vickrey ap- 
proach, by assuming as given, the shape of the business district and then we deter- 
mined the shape and area of the urban land use allocated for transport. The sum of 
the business area and transport area allows us to define the morphology of the city. 
The objective of the model has been to determine the mathematical formulation for 
defining the shape of the city when we assume a polycentric shape of the business dis- 
trict. To ascertain the shape of the business district, we approximated two possible 
business districts. The two different shapes were chosen to investigate how different 
geometrical characteristics influence the urban shape definition. In the first case we 
have considered a polycentric structure with three CBDs, each having the same area 
and thus being of equal importance. When defining the urban shape resulting from 
this business district shape, we noticed how the geometrical relation between the 
CBDs leads to a dominant role of the central CBD. In other words, the central CBD 
is not only an origin and destination of traffic, but it is also a location through which 
all traffic passes 

In the second case we have considered the structure of the business district where 
the CBDs were positional in a dominant hierarchical structure. In this case we have 
seen that the geometrical relationship plays a more subdued role in relation to the hi- 
erarchical relationship among the CBDs; i.e., the CBD with a greater business area 
requires the highest share of land for transport. 

Although based upon a possible approximation of the shape of the business district, 
the methodology discussed here provides a useful analytical tool for evaluating the 
impact of traffic flows, and suggests also a possible means for alleviating detrimental 
impacts of spatial externalities. Above all, we have highlighted in our analysis the in- 
direct crucial role that transport plays in urban morphology formation. 
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Exploring the Relations Between Riverbank Erosion 
and Geomorphological Controls Using Geographically 
Weighted Logistic Regression 


The relations between riverbank erosion and geomorphological variables that are 
thought to control or influence erosion are commonly modelled using regression. For 
a given river, a single regression model might be fitted to data on erosion and its geo- 
morphological controls obtained along the river's length. However, it is likely that the 
influence of some variables may vary with geographical location (i.e., distance up- 
stream). For this reason, the spatially stationary regression model should be replaced 
with a non-stationary equivalent. Geographically weighted regression (GWR) is a 
suitable choice. In this paper, GWR is extended to predict the binary presence or ab- 
sence of erosion via the logistic model. This extended model was applied to data ob- 
tained from historical archives and a spatially intensive field survey of a length of 42 
km of the Afon Dyfi in West Wales. The model parameters and the residual deviance 
of the model varied greatly with distance upstream. The practical implication of the 
result is that different management practices should be implemented at different loca- 
tions along the river. Thus, the approach presented allowed inference of spatially 
varying management practice as a consequence of spatially varying geomorphological 
process 
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INTRODUCTION 


The evolution of river planform is an important area of research in fluvial geomor- 
phology, and is of significance to river managers and owners of riparian land. Bank 
erosion, a key process of planform change, continues to be a focus for process and ap- 
plied rese “arch. The erosion of river banks not only causes change in channel plan- 
form and cross-section but is a key process whereby floodplain sediments are 
remobilised to become a part of the basin sediment yield (Bull 1997). 

The literature associated with channel planform change is now extensive, with re- 
search largely focusing at two scales: (i) mesoscale analysis of channel planform and 
bank retreat rates over periods of up to 200 years | (Gurnell 1997) and (ii) more de- 

tailed site-specific studies of the processes of bank retreat (Hooke 1995; Lawler, 
Thorne, and Hooke 1997; Bull 1997). Relatively few studies attempt to link such dif- 
ferent scales of analysis, and even fewer attempt this at the basin scale. Notable ex- 
ceptions include the work of Lawler (1992) who identified theoretical process 
domains for bank erosion, and Abernathy and Rutherford (1998) who attempted a 
similar analysis including the additional influence of vegetation. However, even these 
studies involve fragmentary information and make no e xplicit attempt to link histori- 
cal channel activity with the full suite of conte mporary controls on the physical 
processes of bank erosion. 

Recent geomorphological investigations into the processes of bank erosion have 
been undertaken through the use of erosion pins and PEEPS (Lawler 1993; Lawler, 
Thorne, and Hooke 1997), investigations into bank stability (Thorne 1982; Simon and 
Hupp 1992; Simon and Darby 1995), and the use of bank reconnaissance methods 
(Thorne, Allen, and Simon 1996). However, these investigations tend to focus on par- 
ticular banks and specific erosion issues, and the detailed data sets required preclude 
investigation of the entire catchment. Further, the multivariate nature of the controls 
and the variability of these in space can prevent identification of the actual processes 
of bank erosion. In particular, geomorphologists are frequently challenged to explore 
relationships that may vary in their nature from place to place (they require spatially 
non-stationary models) and that involve categorical data. 

Bank erosion is a significant problem for many people who derive income from, or 
live on, land next to rivers. Bank erosion management has typically been site-specific 
(tending to treat the problem where it arises) and simplistic (using techniques cen- 
tred on the assumption that the erosion is caused by fluvial scour) (Sear, Newson, 
Brookes 1995; Thorne 1997). Recent application of geomorphological theory to bank 
erosion management has focused on, first, identifying the cause of erosion and, sec- 
ond, developing a more sustainable bank management plan in partnership with engi- 
neers and stakeholders. At the heart of this philosopliy is the ability to identify the 
dominant processes and the controls on these processes operating at the bank scale, 
whilst recognising that the causes may lie at the reach or catchment scale. To date, 
this process has been largely field-b: ised and qualitative, with interpretation of the 
evidence relying on the profe ssional capability of those involved. What is needed is a 
more objective framework for integrating and assessing different types of data across 
a broad range of spatial scales. 

In this paper, new survey and statistical techniques are presented that can be ap- 
plied at the basin scale, and that integrate spatially distributed historical data on the 
timing of river channel planform change with spatially distributed data on the prop- 
erties controlling bank failure. A method for the analysis of spatial variation in the re- 
lations between bank erosion and its controlling factors is presented that incorporates 
non-stationary parameters and categorical data. This method is termed geographi- 
cally weighted logistic regression (GWLR). The objective of this paper is to apply 
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GWLR to model the spatial variation in the relation between erosion (presence or ab- 
sence) and several controlling variables for the Afon Dyfi in West Wales. 


THEORY 


Statistical modelling has been applied extensively to fluvial systems (see Rhoads 
1992). In this section, the basic linear regression model and its extension to (i) spa- 
tially varying parameters (GWR) (Brunsdon, Fotheringham, and Charlton 1996) and 
prediction of a binary response variable (logistic regression) are described. 


Linear Regression 


Multiple linear regression is commonly applie «din ge omorphology and the ste ps in 
modelling and the assumptions involved are well understood. Therefore, only the 
basic linear regression model is given here 


k=l 


predicting the target variable y, at location i as a function of m parameters By and B, 

k=1,2 m—1) and m—1 explanatory variables x4. The €,s are independent nor- 
mally distributed error terms with zero means Usually, the least squares method is 
used to estimate the B,s. This is most readily expressed using matrix notation as 


(4) 


where Bis a single column vector of coefficients, the independent variables fill the 
columns of x, the dependent variable fills the single column of y and t indicates the 
transpose of a matrix (Brunsdon, Fotheringham, and Charlton 1996). To estimate By 
x must contain a first column of Is. 

Equation 3 is implicitly a stationary model of the relations between y and x. A sta- 
tionary model is one in which the parameters of the model do not change with geo- 
graphical location. Thus, it is only appropriate where the relations may reasonably be 
modelled as invariant over space. Where the relations may be expected to change 
with location it is necessary to apply a non-stationary model: one that allows some 
freedom in the parameters of the model as one moves from place to place. Brunsdon, 
Fotheringham, and Charlton (1996) developed such a non-stationary regression 
model and termed it geographically weighted regression (GWR). : A similar model was 
applied in relation to fluvial systems by Rhoads (1991). GWR offers a basis upon 
which to tackle some of the issues mentioned in the introduction and represents a sig- 
nificant pote ntial enhancement over conventional re gression approac thes. 


Geographically Weighted Regression 


GWR extends the traditional regression model of Equation 1 to the non-stationary 
case by allowing the B coefficients to vary with location i, 


Bix TE; 


where B, is the value of the k" parameter at location i. It is worth noting that Equa- 
tion 3 is a special case of Equation 5 where the parameters do not vary across space. 


B= 'x'y = 
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A problem with Equation 5 is that there is only one observation at i for each of y, 

and the m—1 xy. To provide more observations with which to build a regression 

model for each i, Brunsdon, Fotheringham, and Charlton (1996) chose to consider 

the relation between y and x within a region defined as geographically proximate to 
each i, while acknowledging that such an approach involves appr ximation. 

The weighted least squares approach to calibrating regression models provides a 
means by which to vary the influence of observations on the resulting model. Specifi- 
cally, in weighted least squares a we ‘ight w, is applied to each squared difference before 
minimization so that the inaccuracy of some predictions carries more weight than that 
of others. If w is the diagonal matrix of w,s then the estimated coefficients satisfy: 


a 


B = (x'wx)'x'wy (6) 


Since the weight w varies with location j, such that a different calibration exists for 
each j, it is more appropriate to write Equation 6 more explicitly as: 


B= (x'wij)x) 'x'wipy (7) 


Note that location has now been referenced by j since the location at which a model 
is calibrated need not equal the location of available data i. 

Brunsdon, Fotheringham, and Charlton (1996) consider a range of possible 
weighting functions for use in Equation 7. These include a simple step function, 


if 


di <d, 


w, = 0 otherwise (8) 


where d,, is the distance between the location i of any observation and the location j at 
which the model is being calibrated. This weighting function has the advantage that 
the number of data used in the regression model is limited resulting in increased 
speed of computation. However, it has the disadvantage that its discontinuous nature 
can result in artefacts in the spatial variation in the regression coefficients. An alter- 
native is to specify w, as a continuous function of d,. The exponential function is an 
obvious choice, 


where a is a non-linear parameter. As a compromise between the computational sav- 
ing of Equation 8 and the continuity of Equation 9, Brunsdon, Fotheringham, and 
Charlton (1996) suggest using the bisquare function: 


4 = (1-d3/d2) if d,<d, 


iv 


w, = 0 otherwise (10) 


The essential idea of GWR, whatever weighting function is chosen, is to give more 
weight to observations close to the location j at which the regression model is desired 
than further away. In this sense, the method draws strong parallels with geostatistical 
techniques for local prediction (Matheron 1965, 1971; Goovaerts 1997). 


w, =exp| (9) 
a 
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Logistic Regression 


Generalised linear modelling (GLM) (Aitkin et al. 1989; Collett 1991) allows the 
formation of a (multivariate) regression relation between a response variable and ex- 
planatory variable(s). All variables may be either continuous or categorical, or any 
combination of both types. GLM has at least two advantages over traditional regres- 
sion. First, GLM allows one to include variables that are categorical and many, if not 
most, variables in geomorphology are categorical. Second, parameters do not need to 
be normally distributed about their mean and no assumptions are mi ide about their 
error distributions. Thus, most of the limitations of traditional regression are re- 
moved. Examples of this type of modelling in geomorphology include Wang and 
Unwin (1992), Uno, Sugii, and Hayashi. (1994), and Atkinson and Massari (1998). 

In this paper, our interest is the presence or absence of erosion. The most natural 
regression model for this type of binary response (with | = “erosion” and 0 = “no 
erosion”) is the logistic where the usual linear predictor is augmented by a logistic 
link function. The full logistic regression model is thus, 


log =¥, 


~ Pi 
or equivalently 


exply 
1+ exp(y,) 


where p, is the probability of an event occurring associated with a given location i and 

y, is the linear model (equation 3). The linear model formed is then a logistic regression 

of success or failure of a given binary variable (for example, presence or absence of ero- 

sion) on the explanatory variables. Useful introductions to GLM are given in Baker and 

Nelder (1978), Healy (1988), Aitkin et al. (1989), Dobson (1990), and Collett (1991). 
The model deviance D for binary data in a GLM may be expressed as: 


n 
-2> logit(p, ) + log(1 — p, 
i=] 


This model deviance cannot be used directly as an indicator of goodness-of-fit since 
one cannot compare the fitted probabilitie s and the binary observations (Collett 
1991). However, since the difference in deviance for a given difference in degrees of 
freedom between separate models can be approximated to have a Chi-squared distri- 
bution, the reduction in model deviance can be used as an indicator of the signifi- 
cance of each term added to the model (Baker and Nelder 1978). Therefore, terms 
are normally added to the model in a ste pwise fashion, checking the significance of 
each term at each ste p by finding the probability for a given reduction in model de- 
viance and number of degrees of freedom. Only those terms that are significant at a 
given confidence level (for example, 95%) should be retained. Once a term has been 
checked as significant, each parameter estimate should be tested for significance 
against its standard error using the t-value (estimate/s.e.). At the 95% significance 
level a parameter estimate is significant when it is greater than approximately twice 
the standard error (Atkinson and Massari 1998). 
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A potential problem exists for categorical variables that are divided into binary 
terms because only certain of these terms may be significant while others may be in- 
significant. It is common practice to aggregate such categorical variables into fewer, 
coarser classes to reduce the problem and also to simplify the resulting GLM. Inter- 
preting the resulting GLM is quite straightforward. For categorical variables, a posi- 
tive value implies an above average correlation and a negative value implies a below 
average correlation. For continuous variables, a positive value implies a direct corre- 
lation and a negative value implies an inverse correlation, while the magnitude of the 
value gives the strength of the correlation. 

In this paper, GWR and GLM (specifically the logistic model) are combined to 
form a geographically weighted logistic regression (GWLR). This combination of a 
spatial model with logistic regression has similarities with, but is conceptually differ- 
ent to, autologistic regression (Augustin, Mugglestone, and Buckland 1996). The 
GWLR will be used to explore the dependence on geographic location of the relation 
between erosion and geomorphological controls for the Afon Dyfi, Wales. 


FIELD SITE 


AND SURVEY DATA 


The Afon Dyfi 


The Afon Dyfi is a gravel-bed river situated in North Wales, running partly through 
Snowdonia National Park. The river drains from the Aran Mountains and enters the 
sea through an estuary of international importance (designated as a Ramsar site and a 
Biosphere Reserve The river and its catchment have a glacial legacy and are com- 
posed of two parts. The upper reaches flow through bedrock, and there is very little 
floodplain due to glacial terraces and valley walls. The middle and lower re vaches run 
through alluvial and glacial deposits (Jones 1995). Here, wide floodplains and grazing 
land take over from bedrock gorges and wooded riverbanks, although several ter- 
races, both glacial and fluvial, are still present in the middle reaches that confine 
channel movement. 

Riparian landowners have identified several bank erosion issues within the Afon 
Dyfi that relate primarily to loss of valley-bottom farmland. In response to a series of 
requests for bank prote ction, the local river management age ncy initiated a study to 
identify a more rational, catchment-scale management strategy that accounted for 
the processes controlling bank erosion on the Afon Dyfi. To determine the controls 
and influences on the processes of erosion, large-area (both temporal, or historical, 
and spatial) investigations were undertaken (German 2000). The “historical” investi- 
gation provided data on the long-term stability of the planform and the “spatial” in- 
vestigation identified and measured properties that defined the fluvial environment 
at the time of the investigation. All the data sets were contiguous (spatially) for left 
and right bank, from the mouth of the river to its source. 


Historical Data 


A historical stability index was constructed by digitising historical maps and aerial 
photographs from 1890 to 1992. These planforms were imported into the SPANS 
geographical information system (GIS) and annotated with the age of the adjacent 
floodplain and the number of times the channel had remained in its present position. 

The age of the floodplain adjacent to the present bank line (1998) provided informa- 
tion on the age of the present bank material. The banks were graded according to the 
age of the most recent channel as shown in Table 1. The number of times the channel 
had been present in the same position was determined by GIS overlay and was between 
one and five times. From this information a historical stability index was created and 
each individual bank was assigned a number (between 1 and 50), as follows: 
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TABLE | 
The Allocation of Scores to Bank Lines of Different Dates 


Date No 
Betore 1890 10 
1890 
1902 6 
1945 4 
1980 2 


Historical stability = age of the adjacent floodplain 
< number of times channel has been 
in the same position (14) 


The larger the number, the more historically stable was the bank. In addition, the 
width of the planform movement (swath width) was measured every 100 m to deter- 
mine the magnitude of the channel change. 


Field Survey 


Field data were acquired during the 1998 field season using a continuous walk- 
through survey. Data were acquired on several variables including the location of cur- 
rent erosion, vegetation, and bank material. Both banks were surveyed from mouth to 
source. In the field, banks were classified according to the active erosion process act- 
ing on the bank (for example, overhang, cantilever, or slab failure). This information 
was annotated on a base map and subsequently attributed to digitised vectors within 
a GIS package. The same process of data acquisition was used for vegetation and 
bank material. To determine the relationship between the factors controlling bank 
erosion and the physical locations of this erosion, it was considered appropriate to re- 
classify the detailed erosion types into the presence or absence of erosion. Vegetation 
data included type, location on the bank, density, and age. All of these properties pro- 
vided information related to the stability of the bank on which the vegetation grows. 
These data were used to create a vegetation stability index, and individual banks were 
allocated a number (between 1 and 97), as follows: 

Vegetation stability = {(woody age X 3) 
+ (woody density X 3) 
+ (woody location X 3)} 
+ {(herbaceous age X 2) 
+ (herbaceous density X 5) 
+ (herbaceous location X 3)} 
+ {(grass density X 4) 
+ (grass location X 3)} (15) 


Equation 2 defines the weights given to different types of vegetation. The larger the 
number, the greater the contribution to bank stability. Woody vegetation has the 
same weight for each variable (age, density, and location), whereas for herbaceous 
vegetation the weights differ reflecting the view that age is a less important influence 
on bank stability than density (Thorne 1990). Age has two categories (young and ma- 
ture): the older the vegetation the more stable the bank. Density has three categories 
(sparse, average, and dense): the denser the vegetation the more stable the bank. Lo- 
cation has three categories (top, middle, and bottom of the bank in any combination). 
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Each of the three locations has the same weighting: the more locations that the vege- 
tation occupies the larger the score. Table 2 describes the scores assigned to each 
variable in Equation 2. 

Bank material was classified into five categories by field observation. These were 

) cohesive, (ii) composite (30% gravel), (iii) composite (60% gravel), (iv) non-cohe- 
sive (100% gravel), and (v) bedrock. These data were allocated to every bank within 
the river. 

Other variables used in this research were stream power (Bagnold 1966) and the 
location of meander impact points on the bank lines (Thorne 1997). Direct measure- 
ment of stream power was not possible for the whole river due to the absence of sur- 
vey data. Instead, an approach was adopted whereby the cross-sectional area every 
100 m down river was predicted from the bank height and map-derived channel 
widths. Channel gradient was predicted from a 1:25000 map and bankfull discharge 
was calculated using the regime equations of Wharton (1995). Controls on these pre- 
dictions were available at five points along the channel where direct survey had been 
made, or where a gauging station existed. Typically, the values obtained under-pre- 
dicted the locally derived values of stream power by 20%. 

Locations of meander impact points were assigned to the bankline within the GIS as 
0 = no meander impact point and 1 = meander impact point. Each meander impact 
point was spatially distributed, that is, represented a sequence of points along the river. 


Data Organisation 


Data associated with each bank were sorted by location. The sequence of locations 
was used to determine the mid-point of each individual length for each bank. The his- 
tograms for each of the five variables are shown in Figure | (left bank) and Figure 2 
(right bank). Each variable is also plotted against position upstream in Figure 3 (left 
bank) and Figure 4 (right bank). 

The data were entered into a GIS to allow visualisation of the individual spatial 
variables. However, the number of variables available and the length of the river 
(over 42 km) made it difficult to view the different relations between control and ef- 
fect. In addition, for management purposes, it is important to determine any differ- 
ences in these controls at a local level, as this information can provide suitable 
guidance on the type of bank protection that may need to be adopted locally. There- 
fore, a methodology was required that could de al with a large data set and quantify 
the relations between contemporary erosion and several variables at the local scale. 


METHODOLOGICAL CHOICES 


This section describes several of the choices made in applying GWLR to model the 
spatial variation in the relation between erosion (presence or absence) and geomor- 
phological variables for the Afon Dyfi in West Wales. 


TABLE 2 


Scoring System Used to Construct the Vegetation Index 


4 b 
woody vegetahon herbaceous ve vetation grass 
Score Werght Score Weight Score Weight 


: Age 3 12 2 
Density 125 3 125 5 125 4 
Location 123 3 123 3 123 3 


Norte: A score is assigned per variable (e.g.. voung age = score of 1) and all weighted scores are then summed to construct the vegetation 
tides 
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Fic. 2. Histograms of Vegetation, History, Material, Meander and Power for the Right Bank. The vari- 
ables are defined in the text. 
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Fic. 3. Erosion, Vegetation, History, Material, Meander and Power Plotted Against Distance Upstream 
for the Left Bank 
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Fic. 4. Erosion, Vegetation, History, Material, Meander and Power Plotted Against Distance Upstream 
for the Right Bank. 
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Univariate Modelling 


In this paper, the multivariate GWLR was not fitted. Rather, univariate GWLR 
models were fitted to each explanatory variable in turn. The basic rationale for this 
choice was that the main obje ‘ctive of the paper was not pre «diction (using the geo- 
morphological data to determine with greatest precision the level of erosion likely), 
but rather to explore the relations between erosion and several individual geomor- 
phological variables. Multivariate regression is suc ‘h that meaningful relations with 
certain explanatory variables can become downweighted and potentially omitted 
from the multivariate model as other correlated variables enter the model first. To 
avoid this situation, variables must, at least initially, be treated individually. Addi- 
tional benefits of univariate modelling are that the procedure is kept simple, the 
models are easier to interpret, and there are no interaction terms. However, one 
should remember that univariate models do not involve independent effects (i.e., 
controlling for the effects of other explanatory variables). Thus, the fitted parame- 
ters in a set of univariate models may be very different to those in the equivalent 
multivariate model 


Choice of Weighting Function 


The exponential weighting function (Equation 10) with a = 1 km was chosen for 
this research because the experience of the authors in geostatistics suggests that for 
physical properties in geomorphology and related subjects the exponential model 
often describes well the autocorrelation structure. In geostatistics, and specifically 
the linear prediction technique known as (co)kriging, interpolation weights are 
chosen by reference to a model fitted to the experimental autocorrelation function 
or similar) and the exponential model often provides a good fit to the experimental 
plot. Thus, the exponential model represents a reasonable a priori choice. The 
Gaussian model (one of several models used by Fotheringham, Brunsdon, and 
Charlton 2000) is, by comparison, rarely used in geostatistics and describes only the 
autocorrelation structure of variables that are loc ally very smooth (for ex unple, dig- 
ital elevation data) 


Choice of Number of Data 


Initially the exponential model was used to weight all 830 (or 758) data along the 
left (or right) banks of the river. However, the computational burden when running 
the model-fitting procedure in Splus was prohibitive, each variable for each bank 
taking around one hour to run. Therefore, the number of data used in the regres- 
sion modelling was limited to the nearest 101 points, that is, 50 points either side of 
the central point i. This number was chosen as a compromise between having many 
data for statistical regression and few data distant from i. The computational time 
required for model fitting was reduced to around one-fifth of that required when 
using all data 

A disadvantage of using a limited number of data was that when combined with 
the exponential model the weights do not decrease to zero at the most distant point 
as for the bisquare function), but rather decrease to a minimum. This may lead to 
artefacts in the maps of regression coefficients as described earlier. However, we 
chose this strategy because (i) the exponential model is commonly fitted to auto- 
correlation functions in physical geography and (ii) we wanted to fix the number of 
data used in modelling at each location i, (except for the ends of the river course 
where the number decreased). The bisquare function would involve different num- 
bers of data being used at each location i. A comparison between the results using 
all data and the results using only 101 data was made and the differences were only 
slight. 
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ANALYSIS 


The GWLR model was applied to each of the five explanatory variables, and for 
each of the left and right banks. This generated two sets of results. The first is the 
usual slope coefficient plotted against position. The second is a plot of {(null de- 
viance-residual deviance )/null deviance}, here called a deviance ratio. This was pro- 
vided to give some measure of the goodness-of-fit of the model at each location. The 
results for each of the explanatory variables are described below. 


Vegetation 


The model coefficients for vegetation are plotted against position in Figures 5a 
(left bank) and 6a (right bank), while the deviance ratio is plotted against position in 
Figures 7 (left bank) and 8 (right bank). There are several interesting features to 
note. Overall, there is a negative relation between vegetation and erosion. That is, 
the greater the index of vegetation the less erosion; the implication is that vegeta- 
tion makes banks more stable (or, in principle, that more vegetation grows on banks 
that are stable) (Thorne 1990: Millar 2000). However, the nature of the relation be- 
tween vegetation and erosion varies along the river channel. In particular, vegeta- 
tion appears to have a greater influence on erosion for the downstream reaches 
(less than 20 km upstream on the plots). The influence seems to peak at around 10 
km and 15-18 km upstream for both the left and right banks. This again is what 
might be expected since the downstream sections of the river run through alluvial 
deposits where the presence of additional protection from fluvial scour or in- 
creased stability of the bank materials would be more pronounced. It is worth not- 
ing that around 13 km upstream there is no relation between vegetation and 
erosion (for both banks) suggesting that either there is no erosion and no vegetation 
as bank protection is present in this area or the controlling variables do not explain 
erosion in this reach. 

There is a large peak in the influence of vegetation on erosion at around 30 km up- 
stream, especially for the right bank. For the upper reaches of the river, the channel 
is bedrock, but large glacial terraces overlain on the bedrock adjacent to the channel 
provide sufficient material for vegetation to influence erosion. 

Not only does the strength of the relation vary with position, but also there is a di- 
rect relation between the two variables around 20 km upstream. At this point, a high 
vegetation index appears to correspond to large amounts of erosion. This unexpected 
result appears at a position where erosion is dominated by scour of the upper banks 
arising from the turbulence developed around widely spaced trees. 


History 


The most notable historical feature is the small correlation with erosion (Figures 7 
and 8). Generally, one might expect banks that are eroding now to have been eroding 
in the past. However, conte mporary erosion does not necessarily contribute greatly to 
historical lateral change in river planform. Thus, banks that are eroding at present are 
not necessarily related to planform change over the last 108 years. There are two 
main sets of positions at which the relation between history and erosion is inverse as 
expected: one at 10 km and 38 km upstream (left bank) (Figure 5) and one at about 
23 km upstream (right bank) (Figure 6). At 10 km and 38 km (left bank) we expect 
positive erosion and a low historical index value (implying less stability). At 23 km 
(right bank) we expect zero erosion and a high historical index value (implying greater 
ste ibility). ). At around 38 km upstream (right bank) there appears to be an exception to 
the generally inverse relation between history and erosion. We currently have no ex- 
planation for this exception, but it is worth noting that the magnitude of this effect is 
very small (Figure 8). 
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Material 


The deviance ratio for material (Figure 7, left bank and Figure 8, right bank) shows 
that, overall, variation in material has the largest effect on erosion along the Afon 
Dyfi. However, as for vegetation, the influence is not constant along the river. The 
largest peak occurs at around 30 km, with a second peak at around 20 km (both left 
and right banks). These positions correspond to changes from composite banks with 
30% gravel and 60% gravel to bedrock (30 km) and from composite banks with 30% 
gravel to composite banks with 60% gravel (20 km). There is also a notable peak for 
the left bank at around 10 km (Figure 7c). This unexpected relation is most likely to 
be the result of bank protection centred on this location. 

The coefficients of the fitted model are shown in Figures 5b (left bank) and 6b 
(right bank). The plots for composite banks with 30% gravel and composite banks 
with 60% gravel are similar. These two binary variables are competing at positions 
less than 30 km upstream and, therefore, convey similar information. Bedrock is im- 
portant upstream of 30 km as might be expected. Cohesive banks have some influ- 
ence between 20 km and 30 km upstream while banks with 100% gravel, which rarely 
occurs, has limited influence. 


Meander 


Figures 5a, 7 (left bank), 6a, and 8 (right bank) demonstrate that, as for history, 
whether the channel is meandering or not appears to have limited influence on bank 
erosion. Examining the deviance ratios closely reveals that this small influence is 
greatest between 10 km and 30 km upstream. The coefficients for meandering relate 
closely to the deviance ratios. In particular, two areas with large negative coefficients 
occur at around 10 km (and 16 km, right bank) and 24 km (and 28 km, right bank) up- 
stream. As for material (above), the relation observed at around 10 km is likely to be 
the result of bank protection. The relation observed at 24-28 km may be explained by 
the controlling influence of bank material, in this case bedrock that outcrops where 
the meander cuts into the valley side. Overall, the Afon Dyfi has relatively few mean- 
dering sections, and in many cases these have been the focus for bank protection 
measures. In the headwater reaches, meanders are often confined by cohesive glacial 
drift or bedrock. Together these reduce the expected magnitude of correlation be- 
tween meandering and the presence of bank erosion. 


Stream Power 


The most notable feature of the plots of the coefficients and deviance ratios of the 
GWLR for stream power against position upstream (Figure 5a, 6a, 7, and 8) is that 
the influence is greatest for the upper reaches (around 23-27 km to 34 km upstream). 
However, interestingly the relation is inverse for these positions, which was not antic- 
ipated. This can be explained in terms of the large values of unit stream power for the 
upper reaches. Where stream power is very high, there is little erodible material pre- 
sent. Where stream power is lower, the material present includes glacial terraces that 
are more likely to be eroded than the bedrock. In the lower reaches, an inverse rela- 
tion is present for several positions. These relations arise in sections of the river 
where gravel bars result in both relatively low stream power (since channel width is 
maximised) and bank erosion. It is necessary to remember that the values of bankfull 
stream power are all above the critical threshold for the median bed material. Also, 
bank protection, which has not been included in the model data, results in reaches of 
high stream power (hence the need for bank protection) and low erosion. 

There are several positions where a direct relation between stream power and ero- 
sion results, the most important of which appears at 15 km upstream. For these loca- 
tions, weaker alluvial material supports such a direct relation. 
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Fic. 5a. The Regression Coefficients for Vegetation, History, Meander, and Power Plotted Against Dis- 
tance Upstream for the Left Bank. 
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Fic. 5b. The Regression Coefficients for Material Plotted Against Distance Upstream for the Left 
Bank. Material is divided into five binary components as described in the text 
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Fic. 6a. The Regression Coefficients for Vegetation, History, Meander, and Power Plotted Against Dis- 
tance Upstream for the Right Bank. 
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Fic. 6b. The Regression Coefficients for Material Plotted Against Distance Upstream for the Right 
Bank. Material is divided into five binary components as described in the text 
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DISCUSSION 


GWLR in Physical Geography 


GWR was developed primarily for application in human geography, for example, to 
socioeconomic data such as provided by the UK census. In human geography, it is 
reasonable to expect geographical variation in relations. For example, house prices in 
the urban area of Southampton might depend on an entirely different set of variables 
to house prices in its rural surroundings. One example would be the relation between 
house price and proximity to playing fields. In the urban area this might increase 
house price, whereas in the rural area it might decrease it. The most obvious measur- 
able quantity that would describe such a difference is geographical location. 

In physical geography, it is generally assumed that variation in some property over 
space cannot be due to space itself, but rather is due to some control that varies over 
space. The positivist view commonly held by physical geographers suggests that ob- 
servations can be generalised into laws (Fotheringham, Brunsdon, and Charlton 
2000). Thus, the goal of much research in physical geography is to find the underlying 
causal factors responsible for variation in some property (or in the present case, its re- 
lation with other properties) over space. In terms of regression modelling, it should 
be possible to build a single model that describes the entire set of possible circum- 
stances that may arise. However, a general model describing such a large set of possi- 
ble circumstances would include a large number of variables, possibly non-linear 
relations and a large number of interaction terms. Such a model would be cumber- 
some and difficult to interpret. 

An advantage of GWR (or GWLR) is that by fitting the model locally, certain vari- 
ables may be redundant in the local model because they are effectively held constant 
over space. Such terms have low-frequency or long-range variation. For example, in 
the case of the Afon Dyfi it is possible that elevation or some second-order derivative 
of elevation has the greatest influence on erosion. Since this property has not been 
measured, the global regression model would fail to describe well the relation be- 
tween erosion and other geomorphological variables. However, within a region for 
which elevation varies little, the local model may explain most of the local variation in 
erosion. Thus, in an applied context, there is merit to suspending the search for 
causal factors, instead producing predictive models that have value locally and reduce 
the need to acquire large numbers of data in the field. 


GWLR in Geomorphology 


The attraction of GWLR analysis to geomorphology lies in the ability to explore 
spatial variation in the relations between driver variables and process—in this case, 
bank erosion. Such analysis permits the identification of zones of process control, 
which might be expected to vary between reaches of a river, but also between catch- 
ments. Using the Afon Dyfi as a case study, it becomes possible to comment on the 
widely held theoretical view that a specific sequence of process domains exist along a 
watercourse (Lawler 1992; Abernathy and Rutherford 1998). The GWLR analysis of 
the Dyfi data set demonstrates that the scale and sequencing of process domains may 
be much finer and more varied than existing models suggest. Rather, controls on bank 
erosion processes vary at a fine spatial frequency, primarily according to the available 
energy to drive erosion (stream power) and the resistance to that erosion provided by 
bank | materials and vegetation. For the Dyfi at least, channel planform and the histor- 
ical activity of the channel do not appear to influence the process or location of con- 
temporary bank erosion directly, although both may control the location and nature 
of bank materials and vege tation. 


. 
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GWLR for Management 


The results from GWLR provide an insight into the local relations between erosion 
and the controls on that erosion. This information can be used, as part of a manage- 
ment strategy, to focus decisions regarding bank protection options. In a particular 
reach, the fact that, say, vegetation has an influence over erosion locally can be used 
to define the most suitable management recommendations for that reach. This allows 
managers to work with the natural fluvial system on a local basis. By providing under- 
standing of the physical processes that create areas of erosion locally and of how these 
relations can change within a small area of the river, GWLR may allow river manage- 
ment to be more targeted and effective. 

The remainder of this discussion is concerned with several issues which must be 
addressed when implementing GWLR. 


Other Issues 


Although some important advances of understanding and management application 
have been achieved, it is important to exercise caution when interpreting the results 
of GWLR, and this is particularly highlighted by the example of bank material. Since 
the categories of material exhibit relatively low fre “quency spatial variation, the suc- 
cessive peaks and troughs in the influence of material on erosion along the Dyfi may 
reflect local variation (large influence) and lack of local variation (small influence) be- 
tween the classes of material. Thus, the deviance ratio will tend to be small where 
there is little or no variation in the category of material locally, and has the potential 
to be large only where variation in material exists locally. 

An issue that is closely related to that above is significance. GLM is normally an ex- 
ploratory technique: models are developed interactively, which provides an experi- 
enced user with a real “feel” for the data. This interactive procedure for fitting a 
logistic model sits awkwardly alongside GWR for which it is a requirement that fitting 
is automatic. In GWR, models must be calibrated for each desired location j of which 
there are likely to be between one hundred and one million, precluding interactive 
fitting. A consequence of automatic fitting is that some model coefficients are likely to 
be insignificant for certain locations, and for some locations it is possible that the en- 
tire model may not be significant. The issue of lack of significance is discussed by 
Brunsdon, Fotheringham, and Charlton (1996), but is effectively avoided when they 
map the entire set of m coefficients from GWR. 

A further problem with GWLR that should be apparent from Equations 8 to 11 is 
how to choose an appropriate non-linear parameter a for the weighting function. In 
this paper, a choice was made following experimentation. However, an alternative is 
the cross-validation approach suggested by Cleveland (1979), in which the values at 
each i are omitted when calibrating the regression model at i: 


Y ly; (16) 


where y',,(a) is the fitted value of y, with the observation for location i omitted from 
the regression calibration. The optimal value of a can be chosen as the one that min- 
imises the cross-validation score. An important point, however, is that where the ob- 
jective is to explore relations (rather than to predict), variation in the regression 
model with a can be informative. 
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CONCLUSION 


The major attraction of the GWLR approach lies in its ability to permit objective 
quantification of spatial variation in the magnitude and direction of the relations be- 
tween some process-related response variable (bank erosion in this example) and its 
controlling variables. In the present example, for the Afon Dyfi, Wales, the model pa- 
rameters and the residual deviance of the model varied greatly with distance up- 
stream. The practical implication of this result is that different management practices 
should be implemented at different locations along the river. Thus, the approach pre- 
sented allowed inference of spatially varying management practice as a consequence 
of spatially varying geomorphological process. 

Future research will focus on extending the model to (i) the prediction of cate- 
gories and magnitudes of erosion (c.f., a binary response), (ii) multivariate modelling, 
and (iii) new application areas in geomorphology. 
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Measuring Accessibility for People with a Disability 


This paper discusses some of the inherent problems associated with measuring acces- 
sibility for people on a landscape of surfaces, barriers, and travel modes. Along with 
this discussion we propose a new perspective for measuring accessibility with a focus 
on people with differing abilities. Even though our focus is on people with a physical 
disability, such an approach can be easily extended and is able to be generalized to 
other needs and differences. Traditional measurements of accessibility are flawed, as 
they fail to directly account for mobility and physical differences among people. They 
ignore structural barriers and individual mobility limitations that affect travel time, 
effort, and even successful completion. To make sense of this dilemma, we propose an 
accessibility measurement framework that includes measures of absolute access, gross 
access, closest assignment access, single and multiple activity access, probabilistic 
choice access, and relative access. Most of these measures of access have been pro- 
posed by others, but our framework attempts to codify an approach that helps to 
overcome weaknesses in using only the absolute access measurement currently used in 
ADA compliance. Such measures can be used to map accessibility as well as to help se- 
lect the mitigation or renovation projects that yield the greatest increase in accessibil- 
ity for people with disabilities. We argue that for many urban and building design 
problems providing absolute access for people with physical disabilities should be ac- 
companied by the use of a relative access measurement, so that removing barriers can 
be done in the order that provides the greatest improvement in access for a given level 
of expenditure. 


1. INTRODUCTION 


Accessibility is an important characteristic of the geography of space, whether it in- 
volves a small area (e.g., elements within a building) or a large region (e.g., elements 
within a metropolitan area). It is frequently included as a goal in transportation plan- 
ning, land use planning, and building design. The reason for this is that most would 
argue that accessibility is something to value and improve in an urban setting. Beyond 
acknowledging the importance of accessibility as a goal in planning, it has seldom 
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been translated into performance measures by which policies are evaluated (Handy 
and Niemeier 1997). Without an accepted measurement approach for accessibility in 
a planning problem, it is difficult if not impossible to compare alternatives in a ratio- 
nal manner with re spect to changes in accessibility. Perhaps Gould (1969) stated it 
best when he said: “accessibility is.... a slippery notion.... One of those common 
terms that everyone uses until faced with the proble m of de fining and measuring it.” 
There have been some notable exce ptions to the paucity of using a measurable acces- 
sibility criterion in public policy. One of these is the definition of acce ssibility for peo- 
ple with disabilities in their approach to a building. The American with Disabilities 
Act (ADA) states in section 4.3.2 under the title of “Accessibility Guidelines for 
Buildings and Facilities” the following: 


At least one accessible route within the boundary of the site shall be provided from 
public transportation stops, accessible parking, and accessible passenger loading zones, 
and public streets or sidewalks to the accessible building entrance they serve. The ac- 
cessible route shall, to the maximum extent feasible, coincide with the route for the gen- 
eral public 


Measuring accessibility for ADA compliance, is then translated into a simple test by 
the policy: at least one accessible route to a building entrance must be provided from 
the street, from passenger loading zones and when appropriate from public transit 
stops and public parking. It is easy to test compliance for such a policy in that either 
appropriate access is provided or it is not. Thus, the public policy on access for peo- 
ple with disabilities boils down to: it is either provided within an absolute sense or it 
is not. This means that accessibility requirements within the context of the ADA rules 
are standards-based. It is important to recognize that the Code of Federal Regula- 
tions (28 CFR Part 36) dealing with the standards for accessible design describe how 
accessibility to specific elements (e.g., telephone, drinking fountain, or toilet facility) 
should be accomplished as well as the provision of accessible routes to such elements 
and building spaces. Not all elements need to be accessible, just a minimum required 
number. Thus, the measurement of accessibility is defined to the extent that it is easy 
to measure compliance. Although each element (e.g., toilet) is important, our objec- 
tive here is to discuss the general concept of accessibility with respect to people with 
disabilities. 

It should be acknowledged that the above policy and implied measurement ap- 
proach (i.e., standards-based) has led to the improvement of building access in every 
city and town in the U.S. for people with physical impairments. Although this stan- 
dards-based approach has been valuable, it lacks the sensitivity that other measures 
of accessibility might provide. Since the standard is to ensure that absolute access has 
been provided, little attention has been devoted to the value or quality of the access 
provided. Further, providing a second access route to a building is not given any 
value, by virtue that a first access route meets the standard. This paper focuses on the 
weakness of this standards-based approac +h without some attention being given to the 

value of relative access. Our objective is to present a paradigm for measuring access 
for those with physical or mobility impairments within an urban landscape, that ex- 
tends beyond the standards- based approach. This proposed approach is sensitive to 
both the number of routes and the values of each access route provided. It is hoped 
that this proposed paradigm will help pave the way for future accessibility research as 
well as form the basis for which the current standards-based approach is enhanced in 
urban planning and architectural design. In the next section we present an overview 
of the literature on accessibility measures. In the subsequent section we begin the 
process of describing how such measures can be extended to people with a physical 
disability and enhance the standards-based approach of absolute access that is dic- 
tated by the ADA guidelines. Then, we propose several new measures of relative ac- 
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cess that we believe meet the spirit of the current law and provide guidance for im- 
proving urban design for people with disabilities. We demonstrate these concepts 
with several simple access issues faced by individuals on the University of California 
at Santa Barbara campus. We conclude with recommendations for future work. 


2. MEASURING ACCESS, A REVIEW 


Accessibility is determined by the spatial distribution of potential destinations, the 
ease of reaching each destination, and the magnitude, quality, and character of the 
activities found there (Handy and Niemeier 1997). The greater the number of poten- 
tial destinations within some defined time or distance range, the greater the accessi- 
bility. The closer such choice destinations are within this maximum range, the higher 
the level of accessibility. An accessibility measure estimates the level of access to 
some type of activity from a starting location or home location to one or multiple lo- 
cations of that activity given a travel mode, distance, time, and cost constraints. 
Weibull (1976) has presented an axiomatic framework for the development of acces- 
sibility measures. Conceptually, there are seven main types of measures for accessi- 
amy that have been the subject of a number of papers in the literature: 1) counting; 

2) total sums of distances; 3) closest available; 4) gross interaction potential; 5) prob- 
abilistic choice, 6) net and maximum benefit; and (7) absolute. Albeit, there are other 
proposed accessibility measures, the above classes capture many of the elements of 
measuring access. 

The simplest approach involves a count of the number of locations at which an ac- 
tivity can be found within some maximum distance, time or cost of travel from a given 
location or point i (Wachs and Kumagai 1973). Talen and Anselin (1998) call this a 

“container” measure. This means that the accessibility of activity k for a person at lo- 
cation i using travel type / can be calculated as: 


On (1) 


jeM yy 


where: 


the accessibility of person i or zone i with regard to activity k and travel type 


l 

Ox = the number of opportunities for activity k at location j. 

dj = the distance, travel time, or other measure of effort separating i and j for 
person of travel type 1. 

sy = the maximum distance or range over which an activity is considered acces- 
sible for travel type |. One can also use travel time or cost as a criterion as 
well 


My = {j | diy < sy}, the set of activity locations considered accessible. 


Equation (1) represents a simple count of the total number of places where activity k 
is considered accessible in relation to location i and travel type /, within some maxi- 
mum travel distance, time, or cost. The larger this value the greater number of op- 
portunities are within reach of location i for activity k and the better the accessibility. 
We have chosen to differentiate possible differences in travel mode as well as individ- 
ual by the index /, e.g., an older women who owns a car or a young adult who uses a 
wheelchair (see Hagerstrand (1970) for an introduction to time, space, and mobility 
constraints in travel; Miller (1991) and Kwan (1998) for a framework on measuring 
individual accessibility within a space-time framework; Hanson (1995), Hanson and 
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Schwab (1987), and Kwan (1999) for a discussion on travel and accessibility differ- 
ences among people; and Marston, Golledge, and Costanzo (1997) who investigated 
the travel behavior for those who are vision impaired and rely on public transit). Pirie 
(1979) stated that zonal accessibility measures not only neglect the distribution of ac- 
tivities within a zone, but also assume that all individuals within a zone face the same 
set of opportunities. We use the index i to represent a location about which accessi- 
bility is to be measured, not a zone. If the index i is used to represent a zone, then the 
issues raised by Pirie are relevant. 

The first measure of accessibility captures distance only in terms of some pre- 
specified maximum distance, time, or cost of travel. It does not distinguish among the 
accessible sites of the activity to location i, in terms of distance. Ingram (1971) pro- 
posed a measure where accessibility at a given location i is defined as the sum of the 
spatial separations from all other locations j to that location. We can represent this 
measure as: 


Ay = 3d, 0% (2) 


j 


In general, the larger the value, the less accessible are the activities for a given i. A 
central location among a distribution of points is more likely to have a lower accessi- 
bility value, and hence be more accessible than other points. If one imposed a maxi- 
mum distance range for accessible sites (like in equation 1), then it would be difficult 
to compare values across the landscape, because a sum may be low (and hence, sug- 
gest great accessibility), when the sum only included 1 accessible site within the 
range as compared to the presence of many more available sites. Rather than sum 
over all possible activity locations, regardless of distance it makes sense to sum up 
through the m closest activity locations for activity k. This modified measure would 
represent a total sum of distances to the closest set of activity points. Then, a com- 
parison of accessibility at various locations across a landscape would be meaningful. 

In many public services, especially emergency services, accessibility is measured in 
terms of the closest available server or location. Access, then, is not based upon a 
gross number of opportunities and their proximity, or on personal choices, but on 
how far the closest server is from a given i. This accessibility measure has been used 
aS a proxy or surrogate measure of effectiveness in the location of public services 
(Hodgart 1978). We can formalize this using the previous notation as: 


= dy (3) 


where dy, is the closest location to i having a large enough amenity of activity k to 
serve a person at i with access type /. It should be recognized that the closest activity 
k to a location i may differ among those of different access types. This acce sssibility 
measure has been formalized in the p-median location problem where an activity is 
located on a network among p-sites out of n sites in order to maximize acce ssibility. 
(Hakimi 1964; ReVelle and Swain 1970). Whereas most other accessibility measures 
increase in value with increasing accessibility, this measure decreases in value as ac- 
cessibility increases. Thus, when using this type of accessibility measure in urban de- 
sign, the sense of optimization would be to minimize. 

We know that people do not necessarily choose to go to their closest available ac- 
tivity when given a choice. The size or attractiveness of the activity site, the distance, 
and available travel modes all play a role in that choice. One of the models that has 
been used to distribute trips among various destinations is the gravity model. The 
gravity model has been used in a number of different types of applications to estimate 
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the interaction between one location and another. The gravity model estimates inter- 
action in terms of attractiveness of an activity at a given location j (e.g., number of 
jobs, size of retail space, etc.) and distance. Hansen (1959) was the first to propose a 
gravity-based accessibility measure for some activity k, in the following manner: 


=> 


jeMy 


where: 


B, = empirical constant representing the inhibiting effects of distance (time or ef- 
fort) on trip making for travel type / 


The above equation represents a gross measurement of accessibility for a person of 
type / starting at location or zone i with respect to some activity k. The higher the 
value of gross accessibility, the greater the access and number of opportunities for a 
given activity. Gross accessibility to a given activity (e.g., shopping for shoes) is the 
sum of all such activities within some maximum time, distance, or cost limit, each dis- 
counted by how distant each activity is away from location i. An activity location is 
counted high when it is close, and low when it is far away. This discount is based upon 
how distance or time affects the possibility of making a trip to that location for that 
activity. Pooler (1987, 1995) presents a critical discussion on this type of “potential” 
accessibility model. We have taken the liberty of extending the accessibility measure 
suggested by Hansen by adding subscripts for the type of activity (k) and the type of 
person (/). it makes sense to re present different types of people, based upon their 
mobility status (Hanson 1995; Marston, Golledge, and Costanzo 1997). For example, 
if a sub- population at a zone can only move about by bus, then their accessibility to a 
given activity k from zone i will be different than those who own cars and can use ei- 
ther the bus or their car. Examples of such differences are given in Hanson and 
Schwab (1987). From a more rigorous perspective, one can generate a model of ac- 
cessibility related to (4) as: 


Oj 


jeMy 


where the distance attenuating function is of an exponential form. Although the dis- 
tance decay function may be expressed in many ways, the negative expone antial form 
is probably the more widely accepted form used. In the remainder of this paper we 
will use the simple distance decay form of equation (4), even though the form in 
— (5) is applicable as well. 

People make choices in travelling and destination. The gross accessibility measure 
sums up all possible opportunities (discounted by a function of distance) over a range 
of distance, time, or cost. However, it is unlike ly that each person actually visits e “ach 
activity site. More than likely, a choice will be made. Perhaps people with mobility 
type / may choose to go to one of two malls for shoe shopping, even though there are 
several nearby stores. The intervening opportunities model attempts to capture dif- 
ferent competing elements (like size and distance) in consumer choice (Stouffer 
1940). Even trip chaining may make a more distant activity site the most suitable 

(Hanson 1980). The simplest form of measuring accessibility based upon probabili- 
ties can be done in the following way: 


‘ 
| aud =) 
A,= 


88 / Geographical Analysis 


Piper (6) 


jeMy 


where p, = the probability that a person in zone i will go to zone j for activity k in- 
volving travel type |. This measure represents the average distance of travel to activity 
k as the distance to each site j times the probability that that site is the choice (Geert- 
man and van Eck 1995). One approach to estimating the probability of selecting a 
given location j can be to divide the accessibility potential of that site by the sum of all 
of the accessibility potentials generated at i for an activity k. 


> Ond,? 


reM, 


Wang (2000) used equations like 6 and 7 to estimate job accessibility by transporta- 
tion analysis zones in Chicago. We call this a probabilistic or choice- based accessibil- 
ity measure. Since the sum of the probabilities over all j should equal 1, this model 
does not “double count” activities, but only accounts for those activity locations actu- 
ally used. This at first may seem trivial, but consider the following real example. In an 
Agency for International Development project involving the location of health clinics 
in Colombia, Church (1979) discusses a situation where a clinic located in one village 
would not necessarily serve the inhabitants of a nearby village if there was a major 
cultural or political divide. A gross accessibility measure applied to the village would 
rightly consider it accessible. A choice accessibility model applied to the village would 
indeed c apture the fact the nearby clinic would be “tabu.” 

It makes sense that individuals might travel based upon making those choices that 
maximize their net benefits, or simply put, maximize their consumer surplus. An ac- 
cessibility measure for a given location i directed at some activity k could be a model 
that estimates the benefit associated with making the choice of location j that maxi- 
mizes their benefit. Alternate structures are possible, like the sum of net benefits over 
all feasible choices (see, for example, Miller (1999) who provides a framework for 
measuring accessibility benefits within transportation networks). 

Finally, it is important to note that accessibility can be measured in an absolute for- 
mat. For example, access is either provided or it isn’t. Either a building can be ap- 
proached and entered by a person using a wheelchair or it can’t. As stated earlier, 
absolute access is commonly used as a standards-based approach for measuring phys- 
ical access. Either bathrooms have been retrofitted or built for wheelchair access or 
they haven't. As long as there is at least one accessible activity location for each activ- 
ity type, for a given location i, absolute access has been provided and deemed accept- 
able. It makes sense, however, to use the other access measurements for people with 
disabilities (like closest available, gross access, and probabilistic choice access) in ad- 
dition to absolute access as figures of merit for measuring the effectiveness of build- 
ing retrofit plans, urban design, and new building layouts. For example, a ramp at the 
rear of a structure may provide absolute access, but it may make all people who need 
such a ramp to go out of their way as compared to a typical user. That is, absolute ac- 
cess doesn't capture the qualities ‘of access measurements given above. 

In the next section, we focus on the measurement of access for multiple activities. 
Although, all of the measures discussed in this section can be described within a mul- 
tiple activity set, we will focus on gross-potential, probabilistic choice, and closest dis- 
tance. 
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3. ACCESSIBILITY MEASURES FOR MULTIPLE ACTIVITIES 


When measuring accessibility, it is important to look over the range of possible ac- 
tivities. That is, individuals don’t just shop for shoes, but they go to the movies, visit 
the doctor, attend religious services, etc. They may have daily or weekly routines that 
represent a range of activities. The accessibility of various activities will differ, but for 
an individual, what is most meaningful is the totality of activity accessibility. To cap- 
ture this consider the following equation of Total Gross Accessibility: 


Ay =D (8) 
kj 


whe re: 


Ay = the total of Gross accessibility over all types of activities for an individual at i 
of type /. 


Adding all accessibility measures for a given location and access type represents the 
grand total of all gross accessibility measures. There is the potential to oa scaling 
problems as some activities are more important than others. To accommodate for 
this, we can weight each activity by some measure of importance. For example, let's 
consider the frequency of need (e.g., trips per year) for an activity as an importance 
weight for that activity. Thus, we can define Weighted Gross Accessibility as: 


Ay = (9) 
A j 


fi = frequency of trips (e.g.,trips per year) for a person of type ! located at i going 
for activity k. 


This measure is appealing in that it estimates total accessibility for a location and 
type of access. Just as there can be different types of people in terms of mobility, 
there can be different types of people based upon the frequency of use of specific ac- 
tivities. For example, an older person may visit the doctor more often, go to school 
less often, and volunteer their time at agencies more often than a younger person. A 
person using a wheelchair may make less frequent or more frequent trips than an- 
other person with a different mode of access for a given activity. This is still simplistic, 
given that people make multi-stop trips and are confined by a space-time prism. Ex- 
tensions to this simple gross estimate can be made by accounting for such behavior 
and constraints (e.g., Arentze, Borgers, and Timmermans 1994; Southworth 1985; 
Recker, Chen, and McNally 2001). 

We can also extend the concept of total accessibility for a probabilistic choice 
model as well: 


Ay = X ijl (10) 
A j 


This form also sums up accessibility over different types of activities, each weighted 
by the frequency of activity engagement. Again, trip-chaining behavior is not consid- 
ered in this simple measurement. 
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Finally, within a public service context we can create a measure of total access for 
all activities assuming that each individual is served by their closest server of a given 
activity type, and weighting each activity by frequency of need or use. By taking the 
closest available activity access measure and summing over activities weighted by fre- 
quency, we get: 


Ay l l ) 
k 


This measure represents total accessibility as weighted closest available service for 

each activity. This is often called weighted distance and is the form of accessibility 
that is used in the multi- -type facility p-median proble m (see the nested hierarchical 
and non-nested hierarchical p-me dian problems of Weaver and Church 1991). 


4. RELATIVE ACCESSIBILITY 


There are true differences in access based upon the type of individual, |, being ad- 
dressed. Golledge (1993) discusses the idea that even when considering the exact 
same geographic space, people with differing abilities must use, access, and travel 
through that environment using different routes, such that the conception and use of 
that space is “transformed” for different users. He called for measures that would 
help explain how people with disabilities access obstacle-ridden space (Golledge, 
1994). Let us consider the measurement of accessibility from an office in a university 
building to the closest coffee cart or café. Let's consider two people, one who uses a 
wheelchair and the other quite ambulatory, each leaving the same office (2843 Elli- 
son Hall on the UCSB campus) for coffee. This second floor office is close to a stair- 
well that is on the most direct route to the closest coffee cart (see figure 1). The 
ambulatory person leaves the office, goes down one flight of stairs, and exits the 
building on the east entrance, bounds down a few more steps, and then makes a bee- 
line for the coffee cart across the plaza, east of the building. The person using the 
wheelchair heads to the bank of elevators in the center part of the building, waits for 
an elevator, and then takes it to the first floor. That person exits the building through 
the front (south facing) doors, and then heads east around the building to the plaza, 
and then towards the coffee cart. Thus, they both have chosen and reached the same 
cart. The routes for both individuals are depicted in figure 1. 

For both people the cart is accessible in terms of an absolute measure. In fact, each 
measure described above for a single activity accessibility measure could be used. For 
= example, it makes the most sense to apply the closest available access measure, 

, the distance, time, or effort it takes to travel to the coffee cart from the office. For 
ie ambulatory person, the route takes approximately 40 seconds. For the person 
using a wheelchair, the route is longer as it involves a less direct route and an elevator 
wait and ride. Such a trip | is approximately 3 minutes and 30 seconds. The effort taken 
in terms of time spent is 5.25 times that of the ambuk itory person. We can express the 
differences in accessibility in terms of the person using a wheelchair, /, relative to the 
ambulatory person, m, leaving adjacent offices (call it location i ) in the following way: 


Rigi (12) 
ikm 


where k represents the activity of getting a cup of coffee for this example, and 


pelt 

= 
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DESTINATION 
Coffee Cart 


Walking route 
Wheelchair route 


Fic. 1. Traveling between office 2543 in Ellison Hall and the nearest coffee cart on the UCSB campus, 
where one individual is ambulatory and the other uses a wheelchair 


Rigi», = relative accessibility of activity k from location i for person type / relative to 
person of type m. 


If time is used then Ry, = 5.25 for our example of getting coffee. Relative access is 
an important measure because it relates the differences in access relative to individ- 
ual groups of users as well as it aids in understanding just what a physical impairment 
or constraint might represent in the effort needed to overcome obstacles in the envi- 
ronment (e.g., stairs and curbs) and travel. Relative access also helps to operationalize 
the space-time prism paradigm of Hagerstrand (1970) into an easy to understand and 
computable format. It is important to note that Lee and Lee (1998) have used the 
term relative access in a different way to represent the number of feasible competing 
routes to a destination and the number of mode changes. A similar notion, called 
route factor, was raised by Hay (1973) in analyzing the dire sctness of routes between 
cities. De Jong and van Eck | 1996) have used relative gross access and relative count 
access as access potentials and presented example maps for a region around Utrecht. 
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Marston and Golledge (2000) have used this type of time “penalty” to examine con- 
straints faced by visually impaired travelers using their regular navigation methods. 
They also tested the effectiveness of using an auditory signage system, and compared 
3 travel time to those of a sighted user (Marston 2002). Subjects were tested in their 
; relative ability in navigating and making five complex transfers between different 
| transit modes in an urban transit environment. 
Relative access can also be determined for a group or sum of activities. For exam- 
ple, let's assume that both people (in the above example) in performing their daily 
tasks during the work day, must travel three times to the photocopy machine (on the 
average), take two breaks that allow them to get a cup of coffee, travel several times 
to the bathroom, and attend several meetings within the building as well as go to 
g lunch. Summing up accessibility over these types of activities and frequencies, we can 
; obtain the following measure of relative access: 


k 


(13) 


Where Ry, = total relative access of person of type | compared to a person of type m, 
associated with location i assuming that person of type m has the same frequency of 
person /. Here relative access compares one class of individuals (access or mode type) 
to another based upon the same frequency of activities, but compounds the effects of 
spatial accessibility differences based upon the repetition of such activities 
Relative access measures can be used to help understand the impacts of different 
design alternatives in buildings and public areas. For example, constructing a wheel- 
chair ramp might provide absolute access to a building, but alternative placements of 
that ramp may differ considerably in terms of the impact on relative access. Whereas 
an absolute access measure can't distinguish between such positions, the relative ac- 
cess measure helps to measure the impact that the position of a ramp location has on 
the users. Suppose that a building already has an access ramp and is therefore acces- 
sible on an absolute scale and it also has at least one accessible restroom, water foun- 
tain, public telephone, etc. Then absolute access won't change if a second restroom is 
modified to be access friendly. But the gains in relative access might be considerable. 
That is, relative access is a good measure for determining the impact of barrier re- 
: moval across the urban landscape or network. 
= Up to this point, we have introduced the concept of relative access within the con- 
rl text of closest assignment and weighted closest assignment. Actually, relative access 
; can be defined for each of the access measures that have already been introduced. 
These include gross, probabilistic choice, closest assignment, and absolute as well as 
single, multiple, and weighted forms. When atte ntoe a modify the landscape to im- 
prove access for some segment of the population (e.g., ADA compliance) it is impor- 
tant to provide absolute access. In fact, the implied. measure that is used in access 
. compliance is relative absolute access. The issue is to provide absolute access over var- 
ious disabilities for all circumstances in which access is provided to those without dis- 
abilities. In essence, the objective of the ADA program has been to seek relative 
equivalence in terms of absolute access for various types of individual classes. Unfor- 
tunately, absolute access is a relatively crude form of access measurement and neglects 
the impact of the location of such barrier removals (e.g., lowering a public phone, 
making a restroom accessible, etc.) in terms of the movements required by those that 
use them. We argue that not only should access be a goal in terms of ADA compliance, 
but also that it be measured in terms of both absolute and relative access measures. 
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5. AN EXAMPLE IN MEASURING RELATIVE ACCESSIBILITY 


The above concepts should be relatively understandable without an example. But, 
a real example encountered in campus planning at the University of California at 
Santa Barbara (UCSB) should help to underscore the importance of measuring rela- 
tive accessibility. Even though we discuss this issue with respect to wheelchair users, 
we should point out that others pushing kids in strollers, paramedics pushing a gur- 
ney, courier personnel with a package cart, and campus audio-visual personnel push- 
ing TV carts also encounter the same type of physical barriers in moving across 
campus. This access proble m was presented to the campus ADA committee by one of 
the authors (Marston) in 1998. The analysis of relative penalty supported arguments 
made to make several improvements for the routes in question (i.e., an additional 
safety island for crossing the bikeway and making a route smooth so that it could ac- 
commodate those using wheelchairs). Figure 2 gives the layout of the central “core” 

of the UCSB campus. The Unive rsity Center, which houses the bookstore and a num- 
ber of food venders, is located in the bottom right portion of the figure. The Women’s 
Center is located north of the University Center, close to building 477. There are sev- 
eral possible routes that are given in the figure that can be taken in going from the 
University Center to the Women’s Center. A typical person would walk “Route A,” 
which heads directly north along the east edge of Storke Plaza, crosses a bikeway with 
safety islands, and then makes a slight jog to the entrance to the Women’s Center. The 
distance of this route is 650 feet. This heavily traveled route offers a bike crossing that 
splits the two lanes of traffic and offers a safety island so that people cross only one- 

way bike traffic at a time. Route A is the shortest walkable route and represents the 
route chosen by those who are relatively ambulatory. 


\ Parking 


DISTANCE 


DETOUR PENALTY 


seeeeee Rove A 650° 1.00 1 
RouleB 900’ 250° 138 1 
RouleC 1425 75 2.20 2 No/Yes 
RovtleD 2000 1350 307 1 


2. Comparing Routes and Access Between the Women’s Center and The University Center. 


\ » 5 } 
Snidecor Hall _ a 
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Unfortunately, the slope of the sidewalk along the east side of Storke Plaza is too 
steep for many people using wheelchairs. Even though we give this example associ- 
ated with wheelchair users, people using a manual wheelchair generally must look for 
another route. One could travel along the west side of Storke Plaza, (Route B), then 
turn and travel towards the Counseling and Career Center. After crossing the bike 
way and reaching a major east-west oriented path, they could then head east to the 
Women’s Center. This route is 900 feet or 250 feet longer than that of route A. In ad- 
dition, the bike path at this point has no divided lanes with a safety island, making 
crossing more dangerous and difficult. The relative accessibility for having to take 

900 
route B as compared to that of route A equals — = 1.38 (or 38% longer than route 


A). Even though this route is not too much longer than route A and does not have a 
steep slope, it had broken and cracked asphalt that made this route (Route B) im- 
passable for those using wheelchairs. Until this area was repaved and smoothed, peo- 
ple using wheelchairs had to travel east from the University Center to the end of the 
Music building, turn north, continue heading north until crossing the bikepath, and 
then head west to the Women’s Center. This route (Route C) adds 775 feet to the trip 
for a relative access measure (compared to those who are able to travel route A) of 
1425 
2.19 (iLe., = 2.19) or 119% longer than route A. People using this route also 


650 

have to cross the bikeway heading north, along the east side of the Music building, 
where there was no safety island, and then cross another bike ‘way close to the 
Women’s Center. To avoid this ve ry dangerous bikeway crossing along route C, a per- 
son could take Route D, which goes through the Arts building, heads west to 
Snidecor Hall, goes north across the divided bike path in front of the Performing Arts 
Theater, and then back east to the Women’s Center. This route adds 1350 feet to the 
trip as compared to Route A, for a relative access ge alty of 3.07, or 207% longer. 
Taking route D was considerably safer than route C. Until route B was repaved so 
that it was smooth and could allow wheelchair tr: fhe thes people using wheelchairs 
faced a relative access penalty of 2.2 or 3.07 compared to those who are ambulatory. 
This level of penalty is not insignificant and can have other adverse effects on access 
to opportunities. For ex: imple, it has been re porte «1 to one of the authors that a stu- 
dent at a different university, who had a mobility impairment, took three years to gain 
her MA compared to the standard two for her cohort; some re “quired classes could 
not be scheduled in the same term because class times were se parated by only ten 
minutes of travel time and routes did not exist between the classes that could be trav- 
eled by her in that time constraint. 

If a competitive route exists between an origin and destination that provides access 
to people with disabilities, then those people face no additional travel penalties as 
compared to a typical traveler. The greater the disparity between routes available to 
the ambulatory and those who have disabilities, the greater the difference in relative 
access. Measuring access only in terms of an absolute access measure does overlook 
the fact that significant disparities in access may remain between those with an im- 
pairment or limited mobility and the typical user. 

An accessible route is supposed to coincide, when feasible, with the route for the 
general public. In many cases it is either not possible or it is cost prohibitive to make 
the route used by the general public entirely accessible to all people. We believe that 
the relative access measures proposed in this paper should be used to help make the 
most cost effective decisions regarding the selection of routes between buildings, like 
those on a university c ampus, that should be made accessible when removing barvis TS. 

Hagerstrand (1970) introduced the notions of authority constraints and coupling 
constraints in describing the concept of the space-time prism. Coupling constraints 


4 
= 
5, 
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involve activities where more than one individual meets for an activity and authority 
constraints involve restrictions on access due to some social, political, or legal restric- 
tion on access. Measuring access for people of differing abilities must account for 
such conditions as well. For example, the scheduling constraints mentioned above in- 
volving the person with a mobility restriction hampered that individual from taking 
specific classes in the same term. Thus, authority conditions (e.g., published class 
schedules) can have a significant impact on relative accessibility. Forer and Huisman 
(2000) and Huisman and Forer ( 1998a, 1998b) discuss space-time accessibility issues 
within the construct of sequencing and schedules. Including sequencing and sched- 
uling conditions will, in general, exacerbate problems of relative accessibility be- 
tween people with differing abilities. Just as removing physical barriers to open up 
new routes of accessibility, it may also be possible to remove “schedule barriers” and 
improve absolute and relative accessibility by making changes to authority constraints 
(e.g., change published course schedules). 

It is important to recognize that relative accessibility may differ greatly among dif- 
fering spatial scales. For example, accessibility on a university campus differs from 
that of accessibility within a city. As overall travel distances increase for the ambula- 
tory, differences in relative accessibility for those with differing abilities will likely de- 
crease. Unfortunately, many daily activities (e.g., going to lunch or using a restroom) 
involve short trips for the ambulatory, but possibly time consuming trips for those 
with a disability. 


6. CONCLUSIONS 


Providing equal access to all is the goal of the ADA. This act has helped improve 
access to a variety of facilities, including public sector facilities such as ome li- 
braries, and courthouses and private sector facilities such as stores, motels, and the- 
aters. Even the availability of accessible restrooms plays a role in making the 
environment of a facility useful. We know that disparities in access will continue to 
exist, but as money is spent on renovation, re »mode ling, and re moving x barriers across 
the urban landscape, attention should be directed towards making cost effective de- 
cisions, decisions that will help make the greatest improvement in overall accessibil- 

ity. We have proposed that a relative access measure be used in conjunction with the 
more traditional measure of absolute access to help make such decisions. We have 
given several examples that show the relative penalty in comparing those using a 
wheelchair to an ambulatory person. We hope that this discussion and proposal will 
lead to further research as well as better decision-making. 
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